Professional Documents
Culture Documents
Outline
Introduction
Multi-core & Network-on-Chip paradigm
Performance limitations of conventional planar NoCs
Some alternatives
Possibility of designing wireless NoC (WiNoC)
CNT-based antennas
Possible communication schemes
Advantages and challenges of WiNoCs
Summary and road ahead
Multi-core applications
Nokia Sparrow
Intel LARRABEE
The network-on-chip paradigm
Driven by
Increased levels of
integration
Complexity of large
SoCs
New designs
counting 100s of
embedded cores
Need for platform-
based design
methodologies
DSM constraints
(power, delay, time-
to-market, etc)
Decoupling of functionality from communication
Dedicated infrastructure for data transport
NoC features
NoC infrastructure
switch link
NoC limitations
Predominantly multi-hop communication
High Latency and energy dissipation
Use of Express Virtual Channels
Core 1
Core 2
- IP core
- NoC interface
- NoC switch
Lower Latency
and Energy
Dissipation
Three Dimensional
Integration
Optical Interconnects
Wireless/RF
Interconnects
Novel interconnect paradigms for Multicore designs
3D NoC
Brett S. Feero, Partha Pratim Pande, Networks-On-Chip in a Three Dimensional Environment:
A Performance Evaluation, IEEE Transactions on Computers (TC), vol.58, no. 1, pp. 32-45,
January 2009.
Stacking multiple active layers
Manufacturability
Mismatch between various
layers
Yield is currently quite low
Temperature concerns
Despite power advantages,
reduced footprint increases
power density
Photonic NoC
High bandwidth photonic links for high
payload transfers
Limitations on switch architecture
More than 4-port designs are complex
On-chip integration of photonic components
A. Shacham et al., Photonic Network-on-Chip for Future Generations of Chip Multi-
Processors, IEEE Transactions on Computers, Vol. 57, issue 9, pp. 1246-1260.
On-Chip RF/Wireless Interconnects
Replace long
distance wires
Use of waveguides
out of package or
IC structures like
parallel metal wires
Chang et al.
demonstrated
Transmission Line
based RF
interconnect for on
chip
communication
Not really wireless
RF NoC
Bank of high frequency
oscillators and filters
FDM
On-Chip transmission
line acting as data
freeways
Routing of long
transmission lines
without eliminating any
existing links
M. F. Chang et al. CMP Network-on-Chip Overlaid With Multi-Band RF-Interconnect, Proc. of
IEEE International Symposium on High-Performance Computer Architecture, 16-20 February,
2008, pp. 191-202.
Wireless Network-on-Chip (WiNoC)
! Among several options, some may be possible
without a revolutionary technology
! Use of on-chip wireless links
" High bandwidth
" Speed of light
" Long distance
! Reduce latency and energy dissipation in communication
between distant nodes
Early example of on-chip wireless
interconnects
First utilized for distribution of clock signal
Technology: 0.18 um CMOS
Operating frequency: 15 GHz
Single Tone
Modulation and Channelization is not of any concern
~ 2 mm
B. A. Floyd, H. Chih-Ming, and K. K. O, IEEE Journal of Solid-State Circuits, vol. 37, pp. 543-552, 2002.
Propagation mechanisms of radio waves
over
intra-chip channels
Characterization of on-chip radio communications
Monopole Antennas
Measurement is done for the frequency range of 10 100 GHz
Y. P. Zhang et al., Propagation Mechanisms of Radio Waves Over Intra-Chip Channels with Integrated Antennas:
Frequency-Domain Measurements and Time-Domain Analysis, IEEE Transactions on Antennas and Propagation, Vol.
55, No. 10, October 2007, pp. 2900-2906.
CNT antennas
To make the antennas small, we
need small wavelengths # light
(IR, visible, UV)
MWCNT as Optical Antennae
Directional radiation
characteristics are in an excellent
and quantitative agreement with
conventional radio antenna theory
and simulations
K. Kempa, et al., "Carbon Nanotubes as Optical Antennae," Advanced Materials, vol. 19, 2007, pp.
421-426
G. Y. Slepyan, et al. , "Theory of optical scattering by achiral carbon nanotubes and their potential as
optical nanoantennas," Physical Review B (Condensed Matter and Materials Physics), vol. 73, pp.
195416-11, 2006
CNT bundle dipole antennas
SWCNT bundle dipole antennas
The efficiency of a bundle antenna can be 3040 dB higher
than that of a single SWCNT dipole antenna
Y. Huang et al., Performance Prediction of Carbon Nanotube Bundle Dipole Antennas, IEEE
Transactions on Nanotechnology, Vol. 7, No. 3, May 2008, pp. 331-337
Why nanotubes for antenna
application?
Already made by nature! How else would we want to make such small
structures?
Ballistic transport and quantum conductance # low resistive loss
Smooth, defect-free, stable and chemically complete structure # no power
loss due to defects or edge and surface roughness
Structural strength and high conductivity # high current carrying capacity (10
9
A/cm
2
)
Light absorption and generation in
nanotubes
J. A. Misewich, R. Martel, P. Avouris, J. C. Tsang, S. Heinze, and J. Tersoff, Science, vol. 300, pp. 783-786, 2003.
M. Freitag, V. Perebeinos, J. Chen, A. Stein, J. C. Tsang, J. A. Misewich, R. Martel, and P. Avouris, Nano Letters, vol. 4, pp.
1063-1066, 2004.
The CNT is expected to be a linearly polarized dipole radiation source
Conceptual transmitter and receiver
structures
About 10 different
frequency channels are
available.
There is a strong
polarization dependence.
Modulation and demodulation are performed by the antenna itself!
Courtesy: Alireza Nojeh, University of British Columbia
Hybrid Wired/Wireless NoC (WiNoC)
On-chip wireless nodes have
associated overhead
Hybrid architecture
Divide the whole NoC into multiple
subnets
Communication within the subnets is
still through traditional wires
Utilize wireless links for inter-subnet
data exchange
Each subnet will have a wireless base
station (WB)
Subnet architectures may vary and
even be heterogeneous on the same
chip
Network optimization
Limited Wireless Resources
Wireless part of the network should be simple
Position of the WB within the subnet is important
Connectivity in the wireless part of the network
Avoid multi-hop communication in the wireless channels
" Take advantage of speed of light data transfer
Point-to-point wireless links
Adopt small-world network features
" Enable easy scalability for larger system sizes
Minimize the overhead
Avoid complicated MAC protocols
Connecting the subnets
regular lattice
small-world
random graph
Courtesy: Christof Teuscher, Portland State University
Small-World Nets: The Watts-Strogatz Model
Establish high speed long distance links among
distant blocks on the chip
Communication mechanisms with CNT
antennas
Use multiband lasers to excite the antennas
Electroluminescence phenomenon will eliminate this overhead
Laser sources of different frequencies
Establishes a form of FDM
Optical modulators/demodulators
Different frequency channels can be assigned to pairs of
communicating subnets
Antenna elements tuned to different frequencies for each pair
WiNoC with small-world connections
Wireless port
Overall channelization scheme
32-bit flit width
4 distinct frequency
channels
Combination of FDM
and TDM.
Simple on-off keying
Establishing wireless links
Throughput and Latency
WiNoC is capable of improving performance of wireline architectures
Throughput
Latency
Scaling trend
Throughput degrades more if the subnet size is increased rather than
increasing the number of subnets
Summary
WiNoCs are promising alternatives to conventional planar
on-chip networks
Capable of improving NoC performance significantly
CNTs demonstrate interesting optical antenna properties
WiNoCs designed with CNT antennas will have low
overhead.
Road ahead
Overall network design
Development of scalable wireless network
Network optimization
Partitioning of wireless and wired network
Reliability of the wireless channel
Novel ECC schemes
CNT antennas are promising
But some unknowns!
Explore the possibility of NoC with mm-wave
wireless links
Acknowledgements
Dr. Benjamin Belzer, WSU
Dr. Deuk Heo, WSU
Dr. Christof Teuscher, PSU
Dr. Alireza Nojeh, UBC
Mr. Amlan Ganguly
Mr. Kevin Chang, WSU
Mr. Sujay Deb, WSU
Survey of Wireless Network-on-Chip Systems
by
Xi Li
A report submitted to the Graduate Faculty of
Auburn University
in partial fulllment of the
requirements for the Degree of
Master of Electrical Engineering
Auburn, Alabama
May 10, 2012
Keywords: wireless, NoC
Copyright 2012 by Xi Li
Approved by
Vishwani Agrawal, Chair, James J. Danaher Professor of Electrical and Computer
Engineering
Shiwen Mao, Associate Professor of Electrical and Computer Engineering
Jitendra Tugnait, James B. Davis and Alumni Professor of Electrical and Computer
Engineering
Mark Nelms, Professor and Chair of Electrical and Computer Engineering
Abstract
Nowadays, network-on-chip (NoC) systems are becoming more popular due to their big
advantages when compare with systems-on-chip (SoC). Therefore, an increasing number of
researchers and organizations now focus on the study and development of NoC techniques.
As a result, so far many achievements have been gained. Furthermore, considering the
dominant position of wireless and the weakness of wired communication, people also turn to
try to insert wireless links in NoC systems in order to solve the multi-hop problem.
This report gives a brief description of some outstanding developments of NoC and
WNoC (Wireless NoC) systems, including some important technique and the achieved re-
sults, mainly related to the required hardware and communication protocols. In addition,
the report also contains my experiments on NoC and WNoC systems. I use the Booksim
simulator to measure their performances and make some comparisons, and then give some
analysis and conclusion of those results. At the end the report summarizes the nished work
and gives some more developing directions of NoC and WNoC systems.
ii
Acknowledgments
I would like to express my gratitude to people who helped me in this project.
The rst person I must thanks is my advisor Dr. Vishwani D. Agrawal, the James J.
Danaher Professor at Electrical Engineering Department. During the process of the project,
he lent me so much help. When I met problems, he would guide me to solve those problems,
either through emails or in face to face meetings, he always helped me with lot of patience.
Dr. Agrawals kindness and patience made me become interested in this project and provided
me much energy to do my work and also gave me a hope when I felt frustrated.
I am also deeply grateful to my committee members, Dr. Shiwen Mao and Dr. Jitendra
Tugnait for their great teaching and patience during my study.
Another two people I must thanks to are Dr. Alireza Babaei at Auburn and Dr. Partha
Pratim Pande from Washington State University. Dr. Babaei helped throughout the project.
Dr. Pande oered me much useful references. From his help and references, I gained a
signicant amount information that I needed.
I greatly appreciate my colleague and friend, Suraj Sindia, for his help about how to
use latex easily.
Last but not least, I must thank my parents. Though they are no experts in this area,
they encourage me so mush that when I meet diculties, no matter how big they are, I never
lose hope and at last I can overcome them.
Many thanks to my dearest friends in Auburn. Without them, my life in Auburn could
not have been so wonderful.
iii
Table of Contents
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ii
Acknowledgments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . iii
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vi
List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii
1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Problems in Traditional NoC . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
3 Some Solutions and Motivation for Studying Wireless NoC . . . . . . . . . . . . 6
3.1 Ultra Wide Band (UWB) Based WNoC System . . . . . . . . . . . . . . . . 7
3.2 A mm-Wave WNoC System . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.3 CNT Based WNoC System . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.4 Other Needed Devices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
4 Technique Used in a WNoC System . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.1 Structure and Topology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4.1.1 Structure of a Pure Wireless NoC System . . . . . . . . . . . . . . . 12
4.1.2 Structure of Hybrid Wireless NoC System . . . . . . . . . . . . . . . 13
4.2 Wireless Link Insertion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4.3 Routing and Communication Protocol . . . . . . . . . . . . . . . . . . . . . 14
4.3.1 Protocols in Pure Wireless NoC System . . . . . . . . . . . . . . . . . 15
4.3.2 Protocols in a Hybrid Wireless NoC System . . . . . . . . . . . . . . 17
5 A Simulator for NoC System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
6 Factors and Parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
7 Experimental Processes, Results and Analysis . . . . . . . . . . . . . . . . . . . 25
7.1 Experimental Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
iv
7.2 Experimental Result and Analysis . . . . . . . . . . . . . . . . . . . . . . . . 28
7.2.1 Latency vs. Injection Rate . . . . . . . . . . . . . . . . . . . . . . . . 28
7.2.2 Latency vs. Virtual Channels . . . . . . . . . . . . . . . . . . . . . . 36
7.2.3 An Analysis of WNoC System . . . . . . . . . . . . . . . . . . . . . . 38
8 Conclusion and Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
8.1 Work Completed in This Report . . . . . . . . . . . . . . . . . . . . . . . . . 40
8.1.1 Device-Level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
8.1.2 System-Level . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41
8.1.3 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
8.2 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
v
List of Figures
7.1 Example conguration le for simulating a mesh NoC system. . . . . . . . . . . 26
7.2 Simulator output from running the examples/mesh88 lat conguration le. . . . 27
7.3 A 4 4 mesh topology. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
7.4 Latency vs. injection rate in mesh topology. . . . . . . . . . . . . . . . . . . . . 29
7.5 A 4 4 mesh topology with concentration 4. . . . . . . . . . . . . . . . . . . . . 30
7.6 Average hops in 8 8 mesh. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
7.7 Average hops in 8 8 mesh with concentration 4. . . . . . . . . . . . . . . . . . 31
7.8 Latency vs. injection rate in cmesh topology. . . . . . . . . . . . . . . . . . . . 32
7.9 Comparison between regular mesh and concentrated mesh. . . . . . . . . . . . . 33
7.10 (a) Flattened buttery topology consisting of 64 nodes; (b) corresponding router
layout. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
7.11 Latency vs. injection rate in attened buttery topology consisting of (a) 64
nodes and (b) 256 nodes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
7.12 Simulated latency vs. number of virtual channels for (a) regular mesh and (b)
attened buttery topologies. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
7.13 Comparison between regular mesh and attened buttery with (a) 4 virtual chan-
nels, (b) 8 virtual channels and (c) 16 virtual channels. . . . . . . . . . . . . . . 39
vi
List of Tables
3.1 WNoC system parameters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
vii
Chapter 1
Background
Many publications [4], [28], [55] state that the emergence of SoC (System-on-Chip)
platforms consisting of a larger number of embedded processors is a outstanding solution
for some industrial tasks. As described in [23], [76], an SoC is a type of micro system that
integrates many components like processor cores, DSP cores, memories (or storage of control
interface that may be out of the chip) and many other hardware cores, which otherwise
perform specialized tasks on separate single dies. However, according to [23], [65], [76], there
are two limitations of SoC: rst of all, the communication among the Intellectual Property
(IP) blocks impedes the development at the system level, aggravating the complexity issue.
The global wire delays are the most important factors that typically do not scale with
technology scaling even after we insert repeaters [29]. Another problem with SoC is that, SoC
always integrates several dierent hardware cores for dierent applications on the same chip,
but those applications always have diverse requirements such as the dierent communication
standards and specic design constraints. Therefore, there may be diculties when designing
a common SoC for many applications.
In order to solve the problems stated above, NoC (Network-on-chip) is a good paradigm
[3], [4], [23], [64], [76]. NoC is an integrated network that uses routers to allow the commu-
nication among those blocks. It makes use of networking theory and methods for on-chip
communication so that the blocks can exchange information on a chip just like what the
terminals do in the actual world. The so called blocks here will always refer to processors
and caches and, for simplify, a block will be renamed as node in the remaining report.
In a system, the distribution of nodes complies with certain specic topologies. Some
popular topologies have been studied, such as SPIN (Scalable, Programmable, Integrated
1
Network), CLICH (Chip-Level Integration of Communicating Heterogeneous Elements), reg-
ular mesh, Torus, Fold torus, Octagon and BFT (Buttery Fat Tree) [26], [65]. Each node
communicates through routers and since there must be a numbers of routers integrated on
a centimeter-size chip, the size of router should be small enough and the structure should
be as simple as possible. In general, a router in NoC system consists of 1) buers to store
data temporarily, 2) arbiters to decide the sequence of data transmission and 3) switches to
transfer data in the right direction. So far, the wormhole router is the most popular router
used in NoC systems. The advantage of this type of router is that data can be transmitted
more uently. Corresponding to the wormhole router, the transmitted data unit in NoC
is usually called it. A it contains dozens of bits. Each it has a header, which con-
tains the address of the destination, and it is followed by a string of data bits. When being
transmitted, the it runs like a stream through the routers.
A link in NoC is point to point. The common communication between two nodes
is generally based on packet-switching, although there exist other NoC proposals utilizing
circuit-switching techniques. As each channel that connects nodes is duplex and shared by
only two nodes, the NoC improves the performance over the traditional system, which uses
shared-bus to transmit data.
In a NoC system, time latency, power consumption and throughput are the main pa-
rameters used to evaluate its performance. Latency is dened as the time (in clock cycle)
that elapses between the occurrence of a message header injection into the network at the
source node and the reception of the tail of message at the destination node [63]. Therefore,
for a given message, the latency L is given by [65]:
L = sender overhead + transport latency + receiver overhead (1.1)
The transport latency is tiny so that we can ignore it. Send overhead and receiver
overhead mainly depend on the message waiting time in a router, which is in proportion
to the number of nodes on a single chip times the injection rate. For example, if a node
2
requires to transmit data to another node, the data must pass through all routers along the
path between those two nodes. Although a path can be readily determined by any kinds of
routing protocol, the transmitted data must encounter every router on the path. Assuming
a standard mesh network, if node A, positioned is at (1, 1), intends to transmit data to node
B at position (N, M), where (N > 2, M > 2), with some high probability there can be other
transmitting paths that cut across the long path between nodes A and B. That means the
routers in the path from node A to node B will be used by other transmissions at the same
time. Since a router can process only one transmission at a time, when several transmissions
come to the same router, only one of them can be processed immediately while the others
wait until the router is free. In such cases, a high latency in the NoC system is encountered.
Power consumption is another parameter that aects the performance of an NoC system.
Since there are many node-to-node hops in a path and each hop consumes some energy, the
multi-hop must cost more total energy. For each hop, router is the main source of the power
consumption. When a it passes by a router, it will be stored in an input buer to wait
for being processed. If there are more than one its stored in dierent input-buers, an
arbiter should decode the destination information of each it and decide which it to send
rst according to the routing protocol being used. Then, the chosen it is transmitted from
input-buer to output-buer through a crossbars switch. If there is only one it arrives at
the output of the router, it can be sent directly. But if there are several its they have to
be stored in the output buer temporally until their turn. The power dissipated in a NoC
can be calculated as follows [44]:
P =