Professional Documents
Culture Documents
Path Setup For Hybrd NoC Arch Explt Flood Stndby
Path Setup For Hybrd NoC Arch Explt Flood Stndby
Abstract—Future many-core systems will require energy-efficient, high-throughput and low-latency communication architectures.
Silicon Photonics appears today a promising solution towards these goals. The inability of photonics networks to perform inflight
buffering and logic computation suggests the use of hybrid photonic-electronic architectures. In order to exploit the full potential of
photonics, it is essential to carefully design the path-setup architecture, which is a primary source of performance degradation and
power consumption. In this paper, we propose a new path-setup approach which can put allocated circuits in a stand-by state, rapidly
restoring them when needed. Path-setup messages are sent using a flooding routing strategy to enhance the possibility of finding free
optical paths. We compare the proposed approach with a commonly used path-setup strategy as well as some other alternatives
available. The results exhibit encouraging improvements in terms of both performance and energy consumption.
Index Terms—Silicon photonics, path-setup, hybrid photonic-electronic networks-on-chip, parallel architectures, on-chip interconnection
networks
Ç
1 INTRODUCTION
how long to allocate the resources under given power show that this length depends on the process technology scal-
constraints. ing, e.g., in case of a 22 nm process, it is equal to one-tenth of
To fully exploit the benefits inherent in photonics, one the chip edge length. A comparison of different emerging
needs to design the control network as well as the path- technologies for on-chip interconnects is carried out in [11].
setup protocol so as to minimize the path-setup overhead According to the authors, silicon photonics exhibits the low-
while maximizing the number of concurrent optical paths est latency and the highest possible bandwidth in case of
and reducing the probability of conflicts. In this paper, we wavelength division multiplexing. In [12] the authors outline
propose a new path-setup protocol exploiting a flooding the opportunities and challenges of different emerging inter-
routing strategy and the possibility of putting allocated cir- connect technologies, including nanophotonic communica-
cuits in a stand-by state. Compared to traditional path-setup tion. They evaluate the possibility of combining three-
protocols, the main advantages are listed below: dimensional (3D) stacking technology and silicon photonics
to exploit the advantages of both technologies.
The flooding routing strategy provides high adap-
In the recent years, several research efforts have focused
tiveness in routing enhancing the possibility of find-
on proposing various solutions at the architectural level.
ing free paths. In this way, we can reduce the
[13] presents a high-bandwidth multiple-wavelength trans-
number of conflicts that increase the latency and
mission scheme, which essentially exploits the ability to
energy consumption.
transmit parallel wavelength channels through a single
By putting allocated circuits in a stand-by state, the
microring resonator. Shacham et al. [5] proposes a hybrid
number of path-setup attempts are reduced. In this
torus-based NoC made up of a photonic circuit-switched
way, we can allocate an optical path without per- and an electronic packet-switched network. In [14], the
forming an entire path-setup phase, thus limiting the Corona Multiple-Writer Single-Reader (MWSR) photonic
flooding of control messages in the network. This ring is presented. Pan et al. [15] describes Firefly, a clustered
will further improve the energy efficiency. architecture locally connected via an electronic medium and
We compare the proposed approach with a commonly used globally connected via an optical crossbar, partitioned into
path-setup algorithm and some available alternatives. The multiple logical crossbars arbitrated locally. Mo et al.
results exhibit encouraging improvements in terms of both [16] proposes a hybrid optical-electronic mesh-based NoC
performance and energy consumption. exploiting hybrid optical-electronic routers able to perform
A few preliminary results of this work were reported in a selective transmission. In [17], the THOE architecture is pro-
previous conference publication [6]. In this paper, we pres- posed, a low cost torus-based hybrid optical-electronic NoC
ent the full methodology and we consider a more detailed that can be implemented with a lower number of waveguides
architecture description, improved optical loss and power and ring resonators compared to [5]. Bahirat and Pasricha
estimation models, and a comprehensive experimental eval- [18] presents METEOR, a complete communication architec-
uation involving both synthetic traffic and real applications. ture consisting of a reconfigurable electrical mesh coupled
In addition, a formal proof of the deadlock freedom of the with an optical ring whose waveguides can be configured as a
proposed approach is presented. The paper is organized as combination of Single-Writer Multiple-Readers (SWMR) and
follows: Section 2 discusses some related works in the tech- Multiple-Writers Multiple-Readers (MWMR). Beux et al.
nical literature. Section 3 introduces the target architecture [19] introduced CHAMELEON, a Single-Writer Single-
and presents the adopted optical loss and energy models. Reader optical network-on-chip able to exploit reconfigurabil-
Section 4 thoroughly describes the path-setup protocol, the ity both at run- or compile-time in order to open and close
routing algorithm, and the deadlock avoidance policy. dedicated channels between cores. In [20], a hybrid elec-
Section 5 presents the experimental setup and the results tronic/photonic, hybrid-topology NoC, called H2 ONoC, is
achieved. Section 6 provides some final remarks. presented. Thanks to the novel topology, H2 ONoC achieves a
better energy efficiency compared to torus-based architec-
tures. To reduce the energy consumption and achieve a higher
2 RELATED WORK bandwidth, [21] presents a topology comparison between dif-
2.1 Photonic Networks-on-Chip ferent architectures and introduces an improved topology for
While optical interconnects are widely used to provide com- the solution presented in [5]. Finally, [22] presents PhoNoC-
munication facilities over long and medium distances [7], the Map, a computer-aided design tool addressing the design
recent advancements in nanophotonic technologies also space exploration of optical NoC mapping solutions. Pho-
make silicon photonics a promising approach for on-chip NoCMap automatically assigns application tasks to the nodes
communication. Several works in the recent years have ana- of a generic photonic NoC architecture such that either the
lyzed the benefits and limits of optics versus electronics. worst-case insertion loss or crosstalk noise are minimized.
In [8], a comparison between electrical wires and optical inter-
connects in terms of delay and power is presented. The results 2.2 Path-Setup
show that optics is advantageous for global signaling, while A few electronic NoC architectures employ circuit switch-
electrical wires are more power-effective for shorter link ing in order to provide guaranteed service levels. A repre-
lengths. Based on the semiconductor technology roadmap, sentative example is the ÆTHEREAL [23] NoC, developed
[9] investigates the requirements that photonic interconnects at Philips, that provides guaranteed throughput alongside
should meet in order to outperform traditional electronic best-effort service. A basic version of the path-setup proto-
interconnects. In [10] a study of the critical length beyond col relies on four types of control messages: path-setup, path-
which optical interconnect pays off is presented. The results ack, path-nack, and path-teardown. Basically, a path-setup
Authorized licensed use limited to: Georgia Institute of Technology. Downloaded on September 03,2023 at 07:30:41 UTC from IEEE Xplore. Restrictions apply.
FUSELLA ET AL.: PATH SETUP FOR HYBRID NOC ARCHITECTURES EXPLOITING FLOODING AND STANDBY 1405
TABLE 2 TABLE 3
Power Consumption for Electronic Power Consumption for Photonic
Signaling Signaling
reading from the buffer; source to generate just enough power to ensure a proper
taking routing and arbitration decisions; detection of the optical signal at the photodetectors. As a con-
crossing the inner switch; sequence, when generating a new optical signal, we need to
going through the link; evaluate the power loss the signal is subject to when travel-
writing to a buffer. ing to the destination. In this way, we can avoid considering
As a consequence, the energy consumed by sending a mes- the worst-case insertion loss for every communication.
sage end-to-end is equal to the sum of these energy values Table 3 reports the power consumption values for these
times the number of hops: operations including the VCSEL power consumption for
four possible loss values assuming 30 percent laser efficiency
Emessage ¼ Ehop Nhops ; (4)
and 14:2 dBm sensitive receivers. Notice that our approach
where Nhops is the number of hops taken by a message and is independent of the used power consumption values and
Ehop is the energy necessary to cross a single hop: optical loss coefficients, which are only indicative and may
be influenced by technology and architectural variables.
Ehop ¼ ðElink dlink Þ þ Ebuffer þ Ecrossbar þ Estatic ; (5)
TABLE 4 TABLE 5
Simulation Parameters Characteristics of the Architectures Analyzed
Fig. 5. A comparison between the different path-setup architectures in terms of latency under synthetic traffic.
Fig. 7 shows the average latency and energy consump- probability of finding free paths. A better reduction is pro-
tion for each path-setup approach when implementing the vided by Arch-A and Arch-B, able to reach a reduction of
different applications in a 8 8 mesh topology. As in the more than 40 percent compared to the baseline solution.
previous section, the figure displays the breakdown of the Notice that the two virtual channels owned by Arch-B give
latency and energy consumption in terms of path-setup a slight advantage over Arch-A. Last, the proposed
overhead as well as electronic and optical transmission. In approach achieves a 45 percent improvement over the base-
general, the traffic pattern is less challenging than the line solution while, compared to Arch-A and Arch-B, it
worst-case synthetic traffic and hence the network never incurs a negligible 3 percent penalty in terms of latency
saturates. caused by the protocol with standby.
Concerning the latency, the baseline architecture per- Differently, concerning the energy consumption, Arch-C
forms worse than the other architectures. Arch-C performs performs poorly due to the non-minimal behaviour of its
better than the baseline solution, achieving an average flooding routing algorithm: the energy consumption is
reduction of around 20 percent. This is due the flooding around the 50 percent higher than the baseline architecture.
algorithm with non-minimal paths which increases the Arch-A consumes an average 30 percent less energy than
Authorized licensed use limited to: Georgia Institute of Technology. Downloaded on September 03,2023 at 07:30:41 UTC from IEEE Xplore. Restrictions apply.
1414 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 28, NO. 5, MAY 2017
Fig. 6. A comparison between the different path-setup architectures in terms of energy per bit consumption under synthetic traffic.
the baseline solution, while Arch-B consumes an average 12 architectures. This advantage roughly accounts for a 80 per-
percent less energy than Arch-A. Last, the proposed cent reduction compared to the baseline architecture. The
approach consumes less energy compared to all the other results above point out that the architecture exploiting the
path-setup protocol with standby is highly energy efficient,
TABLE 7 while the latency overhead is negligible.
Characteristics of the Applications Analyzed
6 CONCLUSION
Application #Tasks #Comm Avg msg Avg msg TDC This paper shown the importance of using a suitable
links size (Byte) injection rate
path-setup strategy to enable efficient use of the photonic
FPPPP 334 1,145 204.8 0.00064 0.17 resources in hybrid photonic-electronic networks. We
H264-720p_dec 2,311 13,461 1,280 0.000018 0.10
proposed a new path-setup protocol that can reduce the
H264-1080p_dec 5,191 7,781 1,280 0.000018 0.19
SPARSE 96 67 819.2 0.00031 0.02
path-setup latency and the power consumption due to
its ability to put allocated circuits on a stand-by state. In
Authorized licensed use limited to: Georgia Institute of Technology. Downloaded on September 03,2023 at 07:30:41 UTC from IEEE Xplore. Restrictions apply.
FUSELLA ET AL.: PATH SETUP FOR HYBRID NOC ARCHITECTURES EXPLOITING FLOODING AND STANDBY 1415
Authorized licensed use limited to: Georgia Institute of Technology. Downloaded on September 03,2023 at 07:30:41 UTC from IEEE Xplore. Restrictions apply.
1416 IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, VOL. 28, NO. 5, MAY 2017
[29] J. Poulton, et al., “A 14-mw 6.25-gb/s transceiver in 90-nm Edoardo Fusella received the BS, MS, and PhD
CMOS,” IEEE J. Solid-State Circuits, vol. 42, no. 12, pp. 2745–2757, degrees in computer engineering from the Uni-
Dec. 2007. versity of Naples Federico II, Italy, in 2008, 2011,
[30] H. Watanabe, T. Atsushi, K. Takayuki, I. Matsubara, and K. Iwata, and 2015, respectively, where he is currently a
“Vertical cavity surface emitting laser,” U.S. Patent App. 14/447 post-doctoral researcher in the Department of
528, Jul. 30, 2014. Electrical Engineering and Information Technolo-
[31] G. Masini, G. Capellini, J. Witzens, and C. Gunn, “A four-channel, gies. He has been a visiting researcher with the
10 Gbps monolithic optical receiver in 130nm CMOS with inte- cnica de Vale
Universitat Polite ncia, Spain and
grated ge waveguide photodetectors,” in Proc. Conf. Nat. Fiber NEC Europe Ltd., Heidelberg, Germany. His
Optic Eng. Conf., 2007, Art. no. PDP31. research interests focus on the domain of many-
[32] C. Kromer, et al., “A 100-mw 4 10 gb/s transceiver in 80-nm core systems, for which he has been investigat-
cmos for high-density optical interconnects,” IEEE J. Solid-State ing on-chip communication architectures, with emphasis on both elec-
Circuits, vol. 40, no. 12, pp. 2667–2679, Dec. 2005. tronic and optical on-chips networks, ranging from design tools to
[33] C. Batten, et al., “Building manycore processor-to-dram networks implementation and physical design.
with monolithic silicon photonics,” in Proc. 16th IEEE Symp. High
Performance Interconnects, 2008, pp. 21–30. Flich received the PhD in computer engi-
Jose
[34] F. Xia, L. Sekaric, and Y. Vlasov, “Ultracompact optical buffers on
neering, in 2001. He is a full professor with UPV,
a silicon chip,” Nature Photon., vol. 1, pp. 65–71, 2007.
where he leads the research activities related to
[35] W. Bogaerts, P. Dumon, D. V. Thourhout, and R. Baets, “Low-loss,
NoCs. He published more than 100 conference
low-cross-talk crossings for silicon-on-insulator nanophotonic
and journal papers, and has served in different
waveguides,” Opt. Lett., vol. 32, no. 19, pp. 2801–2803, 2007.
conference program committees (ISCA, PACT,
[36] D. Nikolova, et al., “Scaling silicon photonic switch fabrics for
HPCA, NOCS, ICPP, IPDPS, HiPC, CAC, CASS,
data center interconnection networks,” Opt. Express, vol. 23, no. 2,
ICPADS, ISCC), as program chair (INA-OCMC,
pp. 1159–1175, 2015.
CAC) and track co-chair (EUROPAR). He has
[37] N. Eisley and L.-S. Peh, “High-level power analysis for on-chip
collaborated with different Institutions (Ferrara,
networks,” in Proc. Int. Conf. Compilers Archit. Synthesis Embedded
Naples, Catania, Jonkoping, USC) and compa-
Syst., 2004, pp. 104–115.
nies (AMD, Intel, Sun). Current research activities focus on routing,
[38] F. Xia, M. Rooks, L. Sekaric, and Y. Vlasov, “Ultra-compact high
coherency protocols, and congestion management within NoCs. He has
order ring resonator filters using submicron silicon photonic wires
co-invented different routing strategies, reconfiguration and congestion
for on-chip optical interconnects,” Opt. Express, vol. 15, no. 19,
control mechanisms, some of them with high recognition (RECN and
pp. 11 934–11 941, 2007.
LBDR for on-chip networks). He is a member of the HiPEAC NoE. He is
[39] W. J. Dally and C. L. Seitz, “Deadlock-free message routing in
coeditor of the book Designing Network-on-Chip Architectures in the
multiprocessor interconnection networks,” IEEE Trans. Comput.,
Nanoscale Era. He coordinated the FP7 NaNoC project and the H2020
vol. 100, no. 5, pp. 547–553, May 1987.
MANGO project. He is a senior member of the IEEE.
[40] G.-M. Chiu, “The odd-even turn model for adaptive routing,”
IEEE Trans. Parallel Distrib. Syst., vol. 11, no. 7, pp. 729–738, Jul.
2000. Alessandro Cilardo received the five-year
[41] J. Chan, G. Hendry, K. Bergman, and L. P. Carloni, “Physical-layer degree in electronics engineering, magna cum
modeling and system-level design of chip-scale photonic intercon- laude and the PhD degree in computer science
nection networks,” IEEE Trans. Comput.-Aided Des. Integr. Circuits from the University of Naples Federico II, in Janu-
Syst., vol. 30, no. 10, pp. 1507–1520, Oct. 2011. ary 2003 and November 2006, respectively. He is
[42] S. Kamil, L. Oliker, A. Pinar, and J. Shalf, “Communication currently an assistant professor with the Univer-
requirements and interconnect optimization for high-end scien- sity of Naples Federico II. His research activities
tific applications,” IEEE Trans. Parallel Distrib. Syst., vol. 21, no. 2, focus on electronic design automation, high-per-
pp. 188–202, Feb. 2010. formance and embedded computing, as well as
[43] J. S. Vetter and F. Mueller, “Communication characteristics of computer arithmetic, with special emphasis on
large-scale scientific applications for contemporary cluster the application domain of security and cryptogra-
architectures,” J. Parallel Distrib. Comput., vol. 63, no. 9, pp. 853– phy-related processing. He is the single or main author of around 70
865, 2003. peer-reviewed papers published in leading scientific journals and confer-
[44] W. Liu, et al., “A NoC traffic suite based on real applications,” in ences. He is a senior member of the Institute of Electrical and Electron-
Proc. IEEE Comput. Soc. Annu. Symp. VLSI, 2011, pp. 66–71. ics Engineers and a member of the HiPEAC NoE.
Authorized licensed use limited to: Georgia Institute of Technology. Downloaded on September 03,2023 at 07:30:41 UTC from IEEE Xplore. Restrictions apply.