Professional Documents
Culture Documents
This work was jointly supported by the Ministry of Science and Technology under Grant MOST-108-2634-F-006-008 and Grant MOST-109-2628-E-006-010-MY3,
and the Center for Future Computing, Miin Wu School of Computing, NCKU.
ABSTRACT Monolithic 3D stacking of complementary FET (CFET) SRAM arrays increases integration
density multi-fold while supporting the inherent SRAM advantages of low write power and near-infinite
endurance. We propose stacking multiple 8-transistor CFET-SRAM layers on regular CMOS periphery to
achieve an ultra-high-density array for computing-in-memory (CIM). CFET and regular CMOS (FinFET)
devices are measured and calibrated with BSIM-CMG compact model. SPICE simulations are performed
to evaluate the delay of CIM operation, power consumption, and analog computational error due to device
non-linearity. The impact of device non-linearity on neural network inference accuracy is evaluated using
the CIMulator simulation platform. Lower CFET current drive due to amorphous (deposited) silicon
channel is shown to have negligible impact on CIM operational delay in many cases, as the maximum
allowable current is limited by wiring resistance, not transistor drive strength while maintaining accurate
weighted sum. Compared to regular 2D CMOS FinFET array. CFET SRAM cells show an improvement
up to 57.19% in TOPS/W. Furthermore, the performance in TOPS/W mm2 is improved up to 19×. A
factor proportional to the number of stacked layers for monolithically stacked CFET SRAM cells, makes
it highly promising for future edge intelligence.
This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
FIGURE 2. Schematic showing two adjacent CFET 8T-SRAM cells, each has
two CFETs and an average of 2.5 double-layer n-channel nanosheet FET
FIGURE 1. (a) Advantages of SRAM cell in many aspects compared to (NSFETs) (RA2/RA’2 share one NSFET).
other memory devices, while lacking in integration density and
non-volatility [8]. (b) Combining n-channel and p-channel devices into one
CFET [4], and monolithic stacking of multiple CFET layers to improve the
device footprint. (c) Cross-sectional TEM images showing CFET structure
with p-channel on top and n-channel at bottom. Inset showing EDS
mapping of the TEM image for Ti and Hf.
FIGURE 6. (a) Simulated hold static noise margin (HSNM) of 6T-SRAM and
read static noise margin (RSNM) of 8T-SRAM using CFET, (b) HSNM as a
function of Vdd , (C) Critical write current as a function of Vdd with inset FIGURE 8. Components of read and write peripheral circuits placed at the
showing the write N-curve, (d) Critical read current as a function of Vdd bottom of the CFET SRAM stack. Inset shows the discharge current flowing
with inset showing the read N-curve. The above curves are obtained after through read access transistors responsible for MAC nonlinearity.
fine tuning the SRAM cell stability and balancing of read and write speeds.
FIGURE 11. RBL characteristics (θ ) modeling and fitting for FinFET and
CFET device for 4-bit inputs, and weights with the inset showing the MNIST
inference accuracy as a function of epochs during retraining.
MLP neural network described in Fig. 10 requires 17 stacks [2] Y.-D. Chih et al., “16.4 an 89TOPS/W and 16.3TOPS/mm2 all-digital
of CFET SRAM cells of dimension 64 × 60 and 4 stacks SRAM-based full-precision compute-in memory macro in 22nm for
machine-learning edge applications,” in Proc. IEEE Int. Solid-State
of SRAM cells of dimension 256 × 256. Stacking the lay- Circuits Conf. (ISSCC), vol. 64, 2021, pp. 252–254.
ers on top of each other greatly improves the performance [3] J. Ryckaert et al., “The complementary FET (CFET) for CMOS scaling
in terms of TOPS/(W-mm2 ) proportional to the number of beyond N3,” in Proc. IEEE Symp. VLSI Technol., 2018, pp. 141–142.
[4] S.-W. Chang et al., “First demonstration of CMOS inverter and 6T-
stacked layers. However, stacking would incur larger process- SRAM based on GAA CFETs structure for 3D-IC applications,” in
ing costs due to repeated processing steps for every SRAM Proc. IEEE Int. Electron Devices Meeting (IEDM), 2019, pp. 1–7.
stack at the expense of improved performance proportional [5] S. De et al., “Tri-gate ferroelectric FET characterization and modelling
for online training of neural networks at room temperature and 233K,”
to the stacked layers in terms of performance per power per in Proc. Device Res. Conf. (DRC), 2020, pp. 1–2.
layout area. [6] S. Khandelwal et al., BSIM-CMG 110.0.0: Multi-Gate MOSFET
Compact Model: Technical Manual, BSIM Group UC Berkeley,
Berkeley, CA, USA, 2014.
IX. CONCLUSION [7] H.-H. Le et al., “Ultralow power neuromorphic accelerator for deep
learning using Ni/HfO2/TiN resistive random access memory,” in Proc.
Measured CFET and regular CMOS FinFET devices are 4th IEEE Electron Devices Technol. Manuf. Conf. (EDTM), 2020,
accurately calibrated in transfer and output characteristics pp. 1–4.
using the BSIM-CMG compact model. 8T-SRAM cell is [8] W.-S. Khwa, D. Lu, C.-M. Dou, and M.-F. Chang, “Emerging NVM
circuit techniques and implementations for energy-efficient systems,”
realized using CFET technology and SPICE simulations in Beyond-CMOS Technologies for Next Generation Computer Design.
of 8T-SRAM CIM macros are conducted for CFET and Cham, Switzerand: Springer, 2019, pp. 85–132.
FinFET technologies. Evaluation of MAC latency, power, [9] M. A. Baig et al., “Compact model of retention characteristics of
ferroelectric FinFET synapse with MFIS gate stack,” Semicond. Sci.
and non-linearity is carried out, a non-linearity model is Technol., vol. 37, no. 2, 2021, Art. no. 24001.
developed and transferred to a higher-level NN simulator [10] S. De et al., “Ultra-low power robust 3bit/cell Hf0.5Zr0.5O2 ferroelec-
to evaluate system-level inference accuracy. Though FinFET tric FinFET with high endurance for advanced computing-in-memory
technology,” in Proc. Symp. VLSI Technol., 2021, pp. 1–2.
has an advantage in-terms of latency, CFET shows better [11] S. De et al., “Random and systematic variation in nanoscale
performance per power due to shorter SRAM cell height and Hf0.5 Zr0.5 O2 ferroelectric FinFETs: Physical origin and neuromor-
reduced wiring resistance and the efficiency becomes more phic circuit implications,” Front. Nanotechnol., vol. 3, Jan. 2022,
Art. no. 826232. [Online]. Available: https://www.frontiersin.org/
evident for large array sizes. Performance in (TOPS/W mm2 ) article/10.3389/fnano.2021.826232
gets improved by 19× and 7× for 17 and 4 stacked layers of [12] S. De et al., “Robust binary neural network operation from 233 K to
CFET SRAM cells of dimension 64 × 60 and 256 × 256 398 K via gate stack and bias optimization of ferroelectric FinFET
synapses,” IEEE Electron Device Lett., vol. 42, no. 8, pp. 1144–1147,
respectively. Device nonlinearity caused due finite output Aug. 2021.
conductance is recovered by retraining the NN with epochs [13] Gate-all-around field effect transistor having multiple thresh-
as few as 5. CFET being a gate-all-around device has supe- old voltages, by R. Bao et al. (2020, Mar. 10). U.S. Patent
10 586 854. [Online]. Available: https://www.freepatentsonline.com/
rior gate controllability and reduced short channel effects, 10586854.html
making it scalable beyond N3 [3]. In addition, CFET with [14] C.-Y. Huang et al., “3-D self-aligned stacked NMOS-on-PMOS
deposited channel has reduced footprint and is stackable in nanoribbon transistors for continued Moore’s law scaling,” in Proc.
IEEE Int. Electron Devices Meeting (IEDM), 2020, pp. 1–4.
comparison to regular CMOS. [15] J. Singh, S. P. Mohanty, and D. K. Pradhan, Robust SRAM Designs
and Analysis. New York, NY, USA: Springer, 2013.
[16] Q. Chen et al., “Critical current (ICRIT) based SPICE model extraction
ACKNOWLEDGMENT for SRAM cell,” in Proc. 9th Int. Conf. Solid-State Integr. Circuit
Technol., 2008, pp. 448–451.
The authors are grateful to the Taiwan Semiconductor [17] C.-J. Yeh, “Next generation compute-in-memory via monolith-
Research Institute for Nanofabrication facilities, and services. ically integrated 3D-stacked complementary-FET with conven-
National Center for High-Performance Computing for GPU tional CMOS,” M.S. thesis, Inst. Microelectron., NCKU, Tainan,
Taiwan, Jan. 2021. [Online]. Available: http://ir.lib.ncku.edu.tw/handle/
computing facilities. They appreciate PROPLUS Solutions 987654321/204107
for support on software licenses for model parameter [18] P.-Y. Chen, X. Peng, and S. Yu, “NeuroSim: A circuit-level macro
extraction. model for benchmarking neuro-inspired architectures in online learn-
ing,” IEEE Trans. Comput.-Aided Design Integr. Circuits Syst., vol. 37,
no. 12, pp. 3067–3080, Dec. 2018.
[19] Y. LeCun. “The MNIST database of handwritten digits.” 2022.
REFERENCES [Online]. Available: https://ci.nii.ac.jp/naid/10027939599/en/
[1] Q. Dong et al., “15.3 A 351TOPS/W and 372.4GOPS compute-in- [20] S. N. Truong and K.-S. Min, “New memristor-based crossbar array
memory SRAM macro in 7nm FinFET CMOS for machine-learning architecture with 50-% area reduction and 48-% power saving for
applications,” in Proc. IEEE Int. Solid-State Circuits Conf. (ISSCC), matrix-vector multiplication of analog neuromorphic computing,” J.
2020, pp. 242–244. Semicond. Technol. Sci., vol. 14, no. 3, pp. 356–363, 2014.
This work was supported in part by the National Key Research and Development Program of China under Grant 2022YFB3604400; in part by the Suzhou
Science and Technology Program under Grant SYG202131; in part by the Key Program Special Fund in XJTLU under Grant KSF-T-07;
and in part by the XJTLU Research Development Funding under Grant RDF-20-02-43.
ABSTRACT In this work, dual-gate enhancement-mode (E-mode) device based NAND circuit (DG-NAND)
and the NAND block with double E-mode devices (DD-NAND) are developed and fabricated based on the
GaN MIS-HEMTs (metal-insulator-semiconductor-high-electron-mobility-transistors) platform. The DG-
NAND circuit has an area of 0.118 mm2 with the probe pad, which is 24% lower than the area of the
DD-NAND circuit. Both static and dynamic experimental results validate the advantages of performance
improvement of NAND circuits designed by dual-gate technology at an input voltage of 9 V. This paper
demonstrates the design potential of dual-gate NAND in an all-GaN MIS-HEMTs platform through
compact design.
INDEX TERMS Logic circuits, GaN, MIS-HEMTs, dual-gate E-mode device, monolithic integration.
fewer devices and the die area drops by 24%. Indicated par-
asitic parameters are also reduced due to the elimination of
redundant ohmic contacts. The static characterization of the
dual-gate E-mode device NAND circuit indicates a lower
VOL about 1.46 V. From the dynamic measurement results,
the DG-NAND logic circuit observed less falling time, which
decreased by 15%. Furthermore, the dual-gate NAND will
be applied as a subunit of the drive unit development for
compact design and functional integration of the all-GaN
MIS-HEMTs IC platform.
REFERENCES
FIGURE 6. Measured output voltage waveforms of the DD-NAND (green [1] D. Kinzer and S. Oliver, “Monolithic HV GaN power ICs:
line) and DG-NAND (blue line) logic circuits. Performance and application,” IEEE Power Electron. Mag., vol. 3,
no. 3, pp. 14–21, Sep. 2016, doi: 10.1109/MPEL.2016.2585474.
[2] R. Sun, Y. C. Liang, Y.-C. Yeo, C. Zhao, W. Chen,
TABLE 1. Comparison for GaN-based dual-gate device. and B. Zhang, “All-GaN power integration: Devices to func-
tional subcircuits and converter ICs,” IEEE J. Emerg. Sel.
Topics Power Electron., vol. 8, no. 1, pp. 31–41, Mar. 2020,
doi: 10.1109/JESTPE.2019.2946418.
[3] K. J. Chen et al., “GaN-on-Si power technology: Devices and appli-
cations,” IEEE Trans. Electron Devices, vol. 64, no. 3, pp. 779–795,
Mar. 2017, doi: 10.1109/TED.2017.2657579.
[4] C.-L. Tsai et al., “Smart GaN platform: Performance & challenges,”
in Proc. IEEE Int. Electron Devices Meeting (IEDM), 2017, pp. 1–4,
doi: 10.1109/IEDM.2017.8268488.
[5] S. Moench et al., “Monolithic integrated quasi-normally-off gate
driver and 600 V GaN-on-Si HEMT,” in Proc. IEEE 3rd Workshop
Wide Bandgap Power Devices Appl. (WiPDA), 2015, pp. 92–97,
doi: 10.1109/WiPDA.2015.7369264.
[6] D. Kinzer, “GaN power IC technology: Past, present, and future,” in
Proc. 29th Int. Symp. Power Semicond. Devices IC’s (ISPSD), 2017,
pp. 19–24, doi: 10.23919/ISPSD.2017.7988981.
[7] J. S. Moon et al., “<70% power-added-efficiency dual-
gate, cascode GaN HEMTs without harmonic tuning,” IEEE
Electron Device Lett., vol. 37, no. 3, pp. 272–275, Mar. 2016,
doi: 10.1109/LED.2016.2520488.
[8] S. Hazra, A. A. Shuvo, and A. G. Bhuiyan, “Electrical characteristics
of dual gate algan/gan high-electron mobility transistors,” in Proc.
Table 1 compares the dual gate technology for differ- 5th Int. Conf. Electr. Inf. Commun. Technol. (EICT), 2021, pp. 1–4,
ent applications. References [8], [10], and [12] apply a doi: 10.1109/EICT54103.2021.9733454.
[9] B. Kim, H. Q. Tserng, and P. Saunier, “A GaAs dual gate
second gate in the D-mode to improve device performance. power FET for operation up to K band,” in IEEE Int.
Papers [18] and [30] uses a second gate to complete Solid-State Circuits Conf. Dig. Tech. Papers, 1983, pp. 200–201,
the design of a bidirectional device. Both papers [20] doi: 10.1109/ISSCC.1983.1156508.
[10] L. Shi, M. Liu, N. Dong, and X. Lin, “Dual-metal-gate
and [21] utilize a dual-gate structure for the functional block AlGaN/GaN HEMTs for power application,” in Proc. 14th IEEE Int.
of the monolithic integrated GaN platform. Verifying the Conf. Solid-State Integr. Circuit Technol. (ICSICT), 2018, pp. 1–3,
dynamic performance and comparing it with the conventional doi: 10.1109/ICSICT.2018.8565730.
[11] D.-J. Lin, J.-Y. Yang, C.-K. Chang, and J.-J. Huang, “Study
design is necessary. In this paper, a logic circuit module of current collapse behaviors of dual-gate AlGaN/GaN HEMTs
for the GaN MIS-HEMTs platform is designed based on on Si,” IEEE J. Electron Devices Soc., vol. 10, pp. 59–64, 2022,
DG-E device and compared with the conventional design doi: 10.1109/JEDS.2021.3132429.
[12] R. Gong et al., “AlGaN/GaN dual gate MOS HFET for
of the DD-E structure through dynamic and static tests. power device applications,” in Proc. 10th IEEE Int. Conf.
Furthermore, this design also exploits the advantages of Solid-State Integr. Circuit Technol., 2010, pp. 1353–1355,
the MIS-HEMTs platform. The large gate swing lowers the doi: 10.1109/ICSICT.2010.5667654.
[13] M. Cui et al., “Monolithic integration design of GaN-based power
demand for gate voltage protection, resulting in a compact chip including gate driver for high-temperature DC–DC convert-
design. ers,” Jpn. J. Appl. Phys., vol. 58, no. 5, 2019, Art. no. 56505,
doi: 10.7567/1347-4065/ab1313.
[14] A. Li et al., “A monolithically integrated 2-transistor voltage reference
with a wide temperature range based on AlGaN/GaN technology,”
III. CONCLUSION IEEE Electron Device Lett., vol. 43, no. 3, pp. 362–365, Mar. 2022,
In this work, two types of AlGaN/GaN -based NAND logic doi: 10.1109/LED.2022.3146263.
circuits are demonstrated. Static and dynamic measurements [15] Y. Cai et al., “Low ON-state resistance normally-OFF AlGaN/GaN
MIS-HEMTs with partially recessed gate and ZrOx charge trapping
experimentally validate the NAND logic circuit of both struc- layer,” IEEE Trans. Electron Devices, vol. 68, no. 9, pp. 4310–4316,
tures. The dual-gate E-mode device NAND logic cell utilizes Sep. 2021, doi: 10.1109/TED.2021.3100002.
[16] H. Amano et al., “The 2018 GaN power electronics roadmap,” J. [24] W. Saito, Y. Takada, M. Kuraguchi, K. Tsuda, and I. Omura,
Phys. D, Appl. Phys., vol. 51, no. 16, Mar. 2018, Art. no. 163001, “Recessed-gate structure approach toward normally off high-Voltage
doi: 10.1088/1361-6463/aaaf9d. AlGaN/GaN HEMT for power electronics applications,” IEEE
[17] M. Giandalia, J. Zhang, and T. Ribarich, “650 V AllGaNTM power Trans. Electron Devices, vol. 53, no. 2, pp. 356–362, Feb. 2006,
IC for power supply applications,” in Proc. IEEE 4th Workshop doi: 10.1109/TED.2005.862708.
Wide Bandgap Power Device Appl. (WiPDA), 2016, pp. 220–222, [25] R. Sun, Y. C. Liang, Y.-C. Yeo, and C. Zhao, “Au-
doi: 10.1109/WiPDA.2016.7799941. free AlGaN/GaN MIS-HEMTs with embedded current sens-
[18] M. Wolf, O. Hilt, and J. Würfl, “Gate control scheme of monolithically ing structure for power switching applications,” IEEE Trans.
integrated normally OFF bidirectional 600-V GaN HFETs,” IEEE Electron Devices, vol. 64, no. 8, pp. 3515–3518, Aug. 2017,
Trans. Electron Devices, vol. 65, no. 9, pp. 3878–3883, Sep. 2018, doi: 10.1109/TED.2017.2717934.
doi: 10.1109/TED.2018.2857848. [26] M. Cui et al., “Monolithic GaN half-bridge stages with
[19] Y. Zhu, M. Cui, A. Li, F. Li, H. Wen, and W. Liu, “Monolithic integrated gate drivers for high temperature DC–DC buck
DFF-NAND and DFF-NOR logic circuits based on GaN MIS- converters,” IEEE Access, vol. 7, pp. 184375–184384, 2019,
HEMT,” in Proc. Int. Conf. IC Des. Technol. (ICICDT), 2021, pp. 1–4, doi: 10.1109/ACCESS.2019.2958059.
doi: 10.1109/ICICDT51558.2021.9626401. [27] X. Li et al., “GaN-on-SOI: Monolithically integrated all-GaN ICs
[20] A. Chvála et al., “Device and circuit models of InAlN/GaN for power conversion,” in Proc. IEEE Int. Electron Devices Meeting
D- and dual-gate E-mode HEMTs for design and characterisa- (IEDM), 2019, pp. 1–4, doi: 10.1109/IEDM19573.2019.8993572.
tion of monolithic NAND logic cell,” in Proc. 13th Int. Conf. [28] Z. Hu, S. Zhou, R. He, and Q. Zhang, “A broadband voltage variable
Des. Technol. Integr. Syst. Nanoscale Era (DTIS), 2018, pp. 1–6, attenuator with high-power tolerance and compact size based on dual-
doi: 10.1109/DTIS.2018.8368565. gate GaN HEMTs,” IEEE Trans. Power Electron., vol. 38, no. 5,
[21] M. Basler et al., “Building blocks for GaN power inte- pp. 6108–6115, May 2023, doi: 10.1109/TPEL.2023.3242474.
gration,” IEEE Access, vol. 9, pp. 163122–163137, 2021, [29] H. Xu, G. Tang, J. Wei, and K. J. Chen, “Integrated high-speed over-
doi: 10.1109/ACCESS.2021.3132667. current protection circuit for GaN power transistors,” in Proc. 31st
[22] M. D. Feuer et al., “Direct-coupled FET logic circuits on InP,” Int. Symp. Power Semicond. Devices ICs (ISPSD), 2019, pp. 275–278,
IEEE Electron Device Lett., vol. 12, no. 3, pp. 98–100, Mar. 1991, doi: 10.1109/ISPSD.2019.8757685.
doi: 10.1109/55.75724. [30] T. Morita et al., “650 V 3.1 mcm2 GaN-based monolithic bidi-
[23] G. Tang et al., “Digital integrated circuits on an E-mode GaN rectional switch using normally-off gate injection transistor,” in
power HEMT platform,” IEEE Electron Device Lett., vol. 38, no. 9, Proc. IEEE Int. Electron Devices Meeting, 2007, pp. 865–868,
pp. 1282–1285, Sep. 2017, doi: 10.1109/LED.2017.2725908. doi: 10.1109/IEDM.2007.4419086.