Professional Documents
Culture Documents
net/publication/271589727
CITATIONS READS
3 187
5 authors, including:
Aida Todri-Sanial
French National Centre for Scientific Research
150 PUBLICATIONS 515 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
SmartVista: Smart Autonomous Multi Modal Sensors for Vital Signs Monitoring View project
All content following this page was uploaded by Yuanqing Cheng on 01 February 2015.
Xiaolong Zhang∗ , Yuanqing Cheng∗ , Weisheng Zhao∗† , Youguang Zhang∗ and Aida Todri-Sanial‡
∗ School of Electronic and Information Engineering, Beihang University, Beijing, China 100191
† IEF, Université Paris-Sud / CNRS, Orsay, 91405, France
‡ LIRMM – Université Montpellier 2 / CNRS, Montpellier, 34095, France
4 5 7 0
Parameters already defined in PMA MTJ compact model 4 0 S T T -R A M 6 0
3 5 S R A M
Symbol Definition Value 5 0
3 0
)
)
2
φ Potential barrier height of MgO 0.4
2
4 0
A r e a (m m
2 5
A r e a (m m
Hk Anisotropy field 113.0 × 103 A/m 2 0 3 0
W r ite E n e r g y (n J p e r a c c e s s )
S T T -R A M 1 .6 S R A M
4
S R A M 4
W r ite L a te n c y (n s )
W r ite L a te n c y (n s )
1 .4
1 .0
1 .2
3
1 .0 3
0 .8
2 2
0 .5 0 .6
0 .4
1 1
0 .2
0 .0 0 .0
C a c h e 0 C a c h e 0
5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B 5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B
C a p a c ity 5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B C a p a c ity 5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B
(a ) (b )
(a ) (b )
Fig. 3. Write dynamic power comparisons between PMA STT-MRAM and Fig. 6. Write latency comparisons between PMA STT-MRAM and SRAM
SRAM under (a) 32nm (b) 40nm technology nodes respectively under (a) 32nm (b) 40nm technology nodes respectively
6 0 0 0 0 5 0 0 0 0
5 0 0 0 0
thereby will not impact cache performance significantly.
L e a k a g e P o w e r (m W )
L e a k a g e P o w e r (m W )
S T T -R A M 4 0 0 0 0
4 0 0 0 0 S R A M
3 0 0 0 0
3 0 0 0 0 In summary, although PMA STT-MRAM has relative larger
2 0 0 0 0
2 0 0 0 0 write latency, it outperforms SRAM in other metrics such as
1 0 0 0 0
1 0 0 0 0
area overhead, power consumption and read latency. Therefore,
C a c h e
0
5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B C a p a c ity
0
5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B it is advantageous to use PMA STT-MRAM for future on-chip
(a ) (b ) cache design.
Fig. 4. Leakage power comparisons between PMA STT-MRAM and SRAM
under (a) 32nm (b) 40nm technology nodes respectively V. C ONCLUSIONS
As integration density sharply increases with technology
node shrinking, CMOS technology suffers from severe pow-
of SRAM. In contrast, PMA STT-MRAM shows a significant
er consumption. To overcome this problem, some emerging
advantage in term of write power as shown in Fig.3. Although
technologies are proposed including PCRAM, ReRAM and
the power for writing a single STT-MRAM cell is higher
STT-MRAM. STT-MRAM is a competitive candidate to be
than that for a single SRAM cell, PMA STT-MRAM has
used for cache design. However, existing research focuses on
much smaller area thus much lower H-Tree interconnect power.
in-plane STT-MRAM. When MTJ feature size scales down
Moveover, PMA STT-MRAM write latency for a single cell
to 40nm and below, perpendicular magnetic anisotropy effect
is comparable to SRAM. Therefore, PMA STT-SRAM shows
becomes dominant and requires new compact model as well
overall higher power efficiency than SRAM, and a desirable
as architecture level evaluations based on it. In this paper,
scalability with technology node shrinking. The static power
we evaluate benefits brought by PMA STT-MRAM compared
comparison is shown in Fig. 4. Thanks to non-volatile property,
to SRAM from power, performance and area perspectives
PMA STT-MRAM has negligible leakage power compared to
with the aid of a PMA STT-MRAM compact model. The
SRAM similar to its in-plane counterpart. As the technology
simulation results show that PMA STT-MRAM can achieve
node shrinks and capacity increases, the SRAM leakage power
better performance, lower power consumption and smaller area
rises dramatically compared to PMA STT-MRAM, which
overhead and pave the way for next generation cache design.
indicates huge power benefits of PMA STT-MRAM as on-chip
cache.
R EFERENCES
C. Access Latency [1] Claude Chappert et al., “The emergence of spin electronics in data
storage,” Nature Materials, vol. 6, no. 11, pp. 813–823, 2007.
The read latency comparisons at 32nm and 40nm tech- [2] S. Raoux et al., “Phase-change random access memory: A scalable
nology nodes are shown in Fig.5. It shows that SRAM read technology,” IBM Journal of Research and Development, vol. 52, no.
4.5, pp. 465–479, 2008.
access latency is smaller than PMA STT-MRAM when the
[3] J. Joshua Yang et al., “Memristive devices for computing,” Nature
cache capacity is small (e.g., smaller than 4MB). As the Nanotechnology, vol. 8, no. 1, pp. 13–24, 2012.
capacity increases, PMA STT-MRAM shows lower latency due
[4] W. Kang et al., “An overview of spin-based integrated circuits,” in Pro-
to the reducing H-Tree interconnect delay. The write latency ceedings of 19th Asia and South Pacific Design Automation Conference
of PMA STT-MRAM is larger than SRAM as shown in Fig. (ASP-DAC), Singapore, Jan. 20-24 2014, pp. 676–683.
6. However, write path usually doesn’t lie in the critical path [5] L. B. Faber et al., “Dynamic compact model of spin-transfer torque
based magnetic tunnel junction (MTJ),” in Proceedings of IEEE 4th
International Conference on Design & Technology of Integrated Systems
1 0
6 in Nanoscal Era, Cairo, 6-9 April 2009, pp. 130–135.
8 5 [6] G. Sun et al., “A novel architecture of the 3d stacked mram l2 cache
S T T -R A M
R e a d L a te n c y (n s )
R e a d L a te n c y (n s )
3
Performance Computer Architecture, Raleigh, USA, 2009, pp. 239–249.
4
2
[7] Y. Zhang et al., “Compact modeling of perpendicular-anisotropy
2 CoFeB/MgO magnetic tunnel junctions,” IEEE Transactions on Electron
1
Devices, vol. 59, no. 3, pp. 819–826, 2012.
0 C a c h e 0
5 1 2 K B 1 M B 2 M B 4 M B
(a )
8 M B 1 6 M B 3 2 M B C a p a c ity 5 1 2 K B 1 M B 2 M B 4 M B 8 M B 1 6 M B 3 2 M B [8] X. Dong et al., “Nvsim: A circuit-level performance, energy, and
(b )
area model for emerging nonvolatile memory,” IEEE Transactions on
Computer-Aided Design of Integrated Circuits and Systems, vol. 31,
Fig. 5. Read latency comparisons between PMA STT-MRAM and SRAM no. 7, pp. 994–1007, 2012.
under (a) 32nm (b) 40nm technology nodes respectively