Professional Documents
Culture Documents
Abstract—Modern computer system rely more and more on – • minimize silicon infrastructure while supporting high
chip communication protocol to exchange data. System-on-chip performance and low power on-chip communication.
(SoC) designs use bus protocols for high performance data
transfer among the Intellectual Propert (IP) cores. These
The Advanced Microcontroller Bus Architecture
protocols incorporate advanced features such as pipelining,burst
and split transfers. In this paper, we describe a case study for (AMBA) was introduced by ARM in 1996 and is widely
different AMBA SOC bus protocol and their performance used as the on-chip bus in SoC designs. AMBA is a
comparison: the Advanced Micro-controller Bus Architecture registered trademark of ARM. The first AMBA buses were
(AMBA) protocol from ARM. It starts ith a brief introduction Advanced System Bus (ASB) and Advanced Peripheral Bus
AMBA 2.0 protocol, AMBA 3.0 protocol and AMBA 4.0 (APB). In its 2nd version, AMBA 2, ARM[2] added AMBA
protocol, and concludes with a discussion related to a High-performance Bus (AHB) that is a single clock-edge
comparison performance analysis of all three AMBA bus protocol. In 2003, ARM introduced the 3rd generation,
protocol. AMBA3[3], including AXI to reach even higher
performance interconnect and the Advanced Trace Bus
Keywords- SOC, On-chip Communication ,AMBA, Bus
Protocol (ATB) as part of the Core Sight on-chip debug and trace
solution. In 2010, ARM introduced the 4th generation,
AMBA 4,[1] including AMBA 4 AXI4, AXI4-Lite, and
I. INTRODUCTION
AXI4-Stream Protocol,The AMBA 4.0 protocol defines
On-chip communication architectures can have a great five buses/interfaces:
influence on the speed and area of System-on-Chip • Advanced eXtensible Interface (AXI)
Designs. Modern computer system[14] rely more and more • Advanced High-performance Bus (AHB)
on higly complex on–chip communication protocol to
• Advanced System Bus (ASB)
exchange data. The enormous complexity of these protocol
results from tackling high-performance • Advanced Peripheral Bus (APB)
requirements.protocol control can be distributed,and there • Advanced Trace Bus (ATB)
may be non-atomicity or speculation. The electronics The third generation of AMBA defined in the AMBA 3
industry has entered the era of multi-million-gate chips, and protocol specification[7], is targeted at high performance,
thereXs no turning back. This technology promises new high clock frequency system designs and includes features
levels of integration on a single chip, called the System-on-a- which make it very suitable for high speed sub-micrometer
Chip (SoC) design, but also presents significant challenges to interconnect:
the chip designer. Processing cores on a single chip, may • separate address/control and data phases
number well into the high tens within the next decade, given • support for unaligned data transfers using byte
the current rate of advancements [1]. The important aspect of strobes
a SoC is not only which components or blocks it houses, but • burst based transactions with only start address
also how they are interconnected. AMBA is a solution for
issued
the blocks to interface with each other. The oblective of the
AMBA specification [15] is to: • issuing of multiple outstanding addresses
• facilitate right-first-time development of embedded A simple transaction on the AHB consists of an address
microcontroller products with one or more CPUs, GPUs phase and a subsequent data phase (without wait states: only
or signal processors, two bus-cycles). Access to the target device is controlled
• be technology independent, to allow reuse of IP cores, through a Mux(non-tristate), thereby admitting bus-access to
peripheral and system macrocells across diverse IC one bus-master at a time. AMBA 4.0 protocol[12] includes
processes, encourage modular system design to definition of an expanded family interconnect protocols
improve processor independence, and the development including AXI4, AXI4-Lite and AXI4-Stream. The AXI4
of reusable peripheral and system IP libraries protocol adds support for longer bursts and Quality of
450
• Separate read and write data channels to enable completion of the write transaction in fig3. The AXI4
low-cost direct memory access (DMA) protocol[12] enables:
• Ability to issue multiple outstanding addresses • Address information to be issued ahead of the actual data
• Out-of-order transaction completion transfer
• Easy addition of register stages to provide timing • Support for multiple outstanding transactions
closure • Support for out-of-order completion of transactions.
• Protocol includes optional extensions that cover
signaling for low-power operation
Figure 3. AXI 4
451
C. Transaction ordering V. PERFORMANCE COMPARISON OF AMBA PROTOCOL
The AXI protocol enables out-of-order transaction 5.1 Features Comparison:
completion. It gives an ID tag to every transaction across the
interface. The protocol requires that transactions with the AMBA 2 AHB Protocol AMBA 3 AXI Protocol
same ID tag are completed in order, but transactions with Fixed pipeline for address Five independent channels for addr/data and
different ID tags can be completed out of order. Out-of- and data transfers response
order transactions can improve system performance in two Bidirectional link with
Each channel is unidirectional except for
ways: complex timing
single handshake for return path
relationships
• The interconnect can enable transactions with fast
responding slaves to complete in advance of earlier Hard to isolate timing Register slices isolate timing
transactions with slower slaves. Limits frequency of
Frequency scales with pipelining
• Complex slaves can return read data out of order. For operation
example, a data item for a later access might be available Inefficient asynchronous
High performance asynch bridging
from an internal buffer before the data for an earlier access bridges
is Separate address for every
Burst based--one address per burst
Available. data item
Only one transaction at a
4.2 AXI4- Lite Protocol time
Multiple outstanding transactions
452
5.2 Protocol Efficiency
Figure 6: AXI 3
453
requires that the write response for all transactions must not
be given until the clock cycle after the acceptance of the last
data transfer. In addition, the AXI4 protocol requires that
the write response for all transactions must not be given
until the clock cycle after address acceptance
8. Ttransactionwrite interleaving support in AXI4.
9.If the AXI is a 64 bit bus running at 200 MHz, then the
AHB will be a 128 bit bus running at 400 MHz. The burst
sizes will be: small (16 Bytes), medium (32 Bytes), and
large (64 Bytes).
10.AXI3 protocol:• Early termination of bursts it not
supported.A burst must not cross a 4-kbyte boundary.AXI4
protocol longer burst support
REFERENCES
.
Figure 9. AMBA AXI [1] ARM, “AMBA Specification Overview”, available at
http://www.arm.com/.
[2] ARM, “AMBA Specification (Rev 2.0)”, available at
http://www.arm.com.
VI. USING THE TEMPLATE [3] ARM, “AMBA AXI Protocol Specification”, available at
http://www.arm.com.
1. AXI4 address width -32 bits with 16 master and 16 slave [4]. Benini and D. Bertozzi, “Network-on-chip architectures and design
device methods,” IEE Proc.-Comput. Digit. Tech., Vol. 152, No. 2, March 2005
2.Write interleaving. This feature was retracted by AXI4 [5]Chien-Hung Chen, Jiun-Cheng Ju, and Ing-Jer Huang, “A
Synthesizable AXI Protocol Checker for SoC Integration,”IEEE,,ISOCC
protocol. AXI3 master devices must be configured as if 2010.
connected to a slave with Write interleaving depth of one. [6]Ing. Dries Driessens and. Tom Tierens, “Overview Embedded Buses,”
3.AXI4 QoS signals do not influence arbitration priority. 2003 by Patrick Pelgrims.
QoS signals are propagated from masters to slaves. Interface [7]Mick Posner, Darrin Mossor ,”Designing Using the AMBA (TM) 3
AXI (TM) Protocol -- Easing the Design Challenges and Putting the
data widths: AXI4: 32, 64, 128, 256, 512, or 1024 bits an Verification Task on a Fast Track to Success” from Synopsys.
AXI4-Lite: 32 bits [8]Denali Memory Blog Steve Leibson ,”New White Paper discusses the
4. AXI4 address width -32 bits with 16 master and 16 slave challenges of chip design based on AMBA 4”From 2010, Cadence.
device. [9]Marcus Harnisch ,”Migrating from AHB to AXI based SoC
Designs”,from Doulos, 2010
5.The AMBA 4 protocol new features such as 256-beat [10] Krishna Sekar, Kanishka Lahiri,, Anand Raghunathan ,”Dynamically
burst support for improved performance, Quality of Service Configurable Bus Topologies for High-Performance On-Chip
(QoS) signaling for multi-master support, the AXI4-Stream Communication”, IEEE TRANSACTIONS ON VERY LARGE SCALE
protocol for non-address-based, point-to-point INTEGRATION (VLSI) SYSTEMS, VOL. 16, NO. 10, OCTOBER 2008.
[11] Sangkwon Na*, Sung Yang**, and Chong-Min Kyung,”Low-Power
communication among masters and slaves (especially useful Bus Architecture Composition for AMBA AXI ,JOURNAL OF
for data streams commonly used for moving video and SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.9,2009.
audio on an SoC), multiple region interfaces. [12]AMBA® 4 AXI4™, AXI4-Lite™, andAXI4-Stream™ Protocol
6. The AXI4 protocol supports an ordering model based on Assertions user Guide 2010 ARM.
[13]NY Liao,”Analysis of shared-link AXI” IET Comput. Digit. Tech.,
the use of the AXI ID transaction 7.The AXI3 protocol Vol. 3, Iss. 4, pp. 373–383, 2009.
454