Professional Documents
Culture Documents
SERVER ARCHITECTURE
Highly-Scalable x64 Computing Based on Powerful
Six-Core AMD Opteron™ Processors
White Paper
November 2009
Sun Microsystems, Inc.
Table of Contents
Executive Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
Server Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
System-Level Architecture . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
CPU/Memory Modules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
Sun Fire X4640 Server Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
For More Information . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
This Page Intentionally Left Blank
1 Executive Summary Sun Microsystems, Inc.
Executive Summary
Even as the x64 computing architecture has grown in ubiquity, organizations continue
to demand new levels of scalability from their computing platforms. Databases need to
process ever-larger number of transactions. Growing data warehouses require larger
analysis efforts to extract useful information. High performance computing (HPC)
applications are increasingly pressed into service to answer our most critical business
needs and scientific inquiries. While the latest x64 processors offer very impressive
levels of performance, organizations need systems that offer balanced scalability and
performance, to preserve and capitalize on investments.
At the same time, many organizations are responding to new levels of server sprawl
and complexity that have resulted from deploying large numbers of legacy systems.
Administrative, energy, and real estate costs continue to grow unabated — severely
straining IT budgets. Increasingly, many are responding with consolidation strategies
that combine ever more powerful standard x64 systems with innovative new
approaches to virtualization. Not only does consolidation offer to improve vital resource
utilization, but it can also greatly enhance business agility by providing computational
resources where they are needed most.
The Sun Fire™ X4640 server employs the latest high-performance AMD Opteron™
processors in a massively-scalable system that demonstrates Sun's proven datacenter-
centric design focus. Ideal for databases, HPC applications, and mission-critical
applications alike, this server delivers scalable and manageable x64 computing in
dense and servicable rackmount enclosures. Using the latest Six-Core AMD Opteron
processors, the Sun Fire X4640 server is particularly well suited for consolidation and
virtualization initiatives. With compelling compute power, memory capacity, and I/O
density, as well as considerable energy efficiency, organizations can live within their
means even as they grow their IT infrastructure.
With a choice of operating systems — including the Solaris™ Operating System (OS),
Linux, Microsoft Windows, and VMware — the Sun Fire X4640 server effectively
combines Six-Core AMD Opteron processors with balanced system design. Large
memory support, considerable I/O bandwidth, and up to eight processor sockets (up to
48 processor cores) give this server the capacity to handle the most demanding
applications. With datacenter operation as a fundamental design assumption, the
server also offers redundant and hot-swappable components, efficient front-to-back air
flow, highly-efficient power supplies, and built-in system management tools. Beyond
supporting a large number of processors, the Sun Fire X4640 server is engineered to
provide mission-critical application availability, even as it conserves valuable energy
resources and lowers operational expenses. This document details the systems
architecture of Sun Fire X4640 server, along with key software components.
2 Delivering Massive x64 Single-System Scalability Sun Microsystems, Inc.
Chapter 1
Delivering Massive x64 Single-System Scalability
are also hot-swappable. Four integrated Gigabit Ethernet ports enhance network
availability and can be utilized in failover configurations. On-board system
management tools encourage proactive remote monitoring and intervention.
• Tightly-Integrated Management
To support out-of-band management, the Sun Fire X4640 server incorporates an
Integrated Lights Out Management (ILOM) service processor. This built-in
hardware-based management functionality allows administrators to monitor and
manage systems remotely, letting them take corrective action as necessary to
minimize unplanned downtime.
Sun Fire X4640 servers combine best-in-class performance with noteworthy compute,
memory, and I/O capacities. As a result, these systems are designed to scale up, scale
out, and scale within, enabling deployment in a wide range of application
architectures:
• Scale-up architectures — With up to 48 processor cores available per system, this
server is well suited to scale for growing workloads that deliver Web, database, and
other key infrastructure services.
• Scale-out architectures — With large memory capacities, significant internal storage,
four Gigabit Ethernet ports, and high-bandwidth PCI Express expansion, these servers
can scale to solve complex computing problems that demand intensive computing
power and data bandwidth.
• Scale-within capabilities — With the ability to support a wide range of virtualization
technologies, Sun Fire X4640 servers are ideal systems to consolidate multiple
applications within a single extensible platform.
Figure 1. The Sun Fire X4640 server offers up to 48 AMD Opteron cores in only four
rack units (4U).
5 Delivering Massive x64 Single-System Scalability Sun Microsystems, Inc.
System Comparisons
The Sun Fire X4640 server complements Sun’s extended x64 server product line. Sun
provides a wide range of AMD Opteron based servers with diverse capabilities. Table 1
compares the features of Sun Fire X4140, X4440, and X4640 servers.
Table 1. Sun Fire X4140, X4440, and X4640 server feature comparison
Feature Sun Fire X4140 Server Sun Fire X4440 Server Sun Fire X4640 Server
Processors One or two Six-Core AMD Two or four Six-Core AMD Two, four, six, or eight Six-Core
Opteron series 2000 processors Opteron Series 8000 processors AMD Opteron Series 8000
processors
Memory capacity Up to 128 GB Up to 256 GB Up to 512 GB
(2, 4, or 8 GB DDR2 DIMMs) (2, 4, or 8 GB DDR2 DIMMs) (2, 4, or 8 GB DDR2 DIMMs)
Maximum internal Up to eight SFF 2.5-inch SAS 73 Up to eight SFF 2.5-inch SAS 73 Up to four SFF 2.5-inch SAS 73 or
disk drives or 146 GB disk drives, choice of or 146 GB disk drives, choice of 146 GB disk drives, internal RAID
RAID HBAsa RAID HBAsa
RAID Hardware RAID 0, 1, 1E, 10, 5, Hardware 0, 1, 1E, 10, 5, 5EE, Hardware 0 or 1
5EE, 50, 6, and 60 with SAS 50, 6, and 60 with SAS RAID
RAID HBA HBA
Removable and Slimline DVD+/-RW Slimline DVD-R Slimline DVD/CD-RW
pluggable I/O Five USB 2.0 ports (Two front, Five USB 2.0 ports (Two front, Five USB 2.0 ports (Two front,
two rear, one internal) two rear, one internal) two rear, one internal)
Expansion slots Two x8 PCI Express slots, One x4 PCI Express slot, Two PCI-X slots,
One x16 PCI Express slot Four x8 PCI Express slots, and Four x8 PCI Express slots, and
One x16 PCI Express slot Two x4 PCI Express slots
Ethernet Four on-board Gigabit Ethernet Four on-board Gigabit Ethernet Four on-board Gigabit Ethernet
ports (10/100/1000Base-T) ports (10/100/1000Base-T) ports (10/100/1000Base-T)
Power supplies Dual redundant hot-swappable Dual redundant hot-swappable Four redundant hot-swappable
AC 650 W power supply units AC 1050 W power supply units AC 1133 W power supply units
(N+N redundancy) (N+N redundancy) (N+N redundancy)
Fans Redundant, hot-swappable fan Redundant, hot-swappable fan Redundant, hot-swappable fan
modules (N+N redundancy) modules (N+N redundancy) modules (N+N redundancy)
Form factor 1 rack unit (1U) 2 rack units (2U) 4 rack units (4U)
Enterprise-Class Capabilities
Ultimately, datacenter requirements go beyond mere performance and scalability.
Effective servers must provide considerable flexibility to serve a wide variety of needs in
an agile fashion. The Sun Fire X4640 server is designed for datacenter operation —
running multiple operating systems and providing significant reliability, availability,
and availability (RAS) features. Ultimately, Sun’s innovative chassis design across its x64
product line contributes directly to the datacenter utility of these powerful systems.
Taking advantage of the Solaris 10 OS can bring added flexibility and power to the
enterprise. Supporting systems from laptops and single-board computers to
datacenter and cluster installations, the Solaris 10 OS serves applications ranging
from military command and control systems, to telecommunication switch gear,
to stock trading. The Solaris 10 OS also includes more than 180 applications from
the free and open source software (F/OSS) community, and thousands of others
are freely available for download over the Internet.
• Linux Environments
Sun offers and supports the leading Linux variants on Sun Fire x64 servers,
including Red Hat Enterprise Linux and Novell SUSE Linux Enterprise Server. As the
leader in enterprise services for UNIX®, Sun brings decades of expertise to Linux
environments. Sun support contracts for Linux provide all front-line support and
transparent access to back-line support from Red Hat and Novell.
The following key design elements help increase the dependability of IT services:
Chapter 2
AMD Opteron™ Processor Technology
The Sun Fire X4640 server is powered by the latest AMD Opteron processors, utilizing
AMD’s Direct Connect Architecture and NVIDIA chipsets for scalability and fast I/O
throughput. The server supports Forth-Generation Six-Core AMD Opteron processors.
The sections that follow describe the architecture and feature set of supported AMD
Opteron processors.
AMD Opteron processors are offered in three series as shown below. Sun Fire X4640
servers employ 8000 Series AMD Opteron processors.
• 1000 Series — Single socket
• 2000 Series — Up to two sockets (one coherent HyperTransport 3 link)
• 8000 Series — Four to Eight sockets (three coherent HyperTransport 3 links)
512 KB
128 128 KB 128 KB 128 KB 128 KB 128 KB
L2 Cache
L1 L1 Cache L1 Cache L1 Cache L1 Cache L1 Cache
512 KB 512 KB 512 KB 512 KB 512 KB 512 KB
L2 Cache L2 Cache L2 Cache L2 Cache L2 Cache L2 Cache
6 MB L3 Cache
Crossbar Switch
Figure 2. Support for six cores increases computational throughput in the same
thermal envelope as previous-generation AMD Opteron processors.
The Six-Core AMD Opteron processor offers a number of key advantages, including:
• Six cores, each with 128 KB of Level-1 cache and 512 KB of Level-2 cache
• 6 MB Level-3 cache shared by all processor cores
• HyperTransport 3 support for up to 4.8 GT/sec. (19.2 GB/sec.) per link
• HyperTransport Assist (HT Assist) to help improve overall system performance in
multisocket systems
• Dual Dynamic Power Management support
• AMD-V and AMD-P support
13 AMD Opteron™ Processor Technology Sun Microsystems, Inc.
Fourth-Generation Six-Core AMD Opteron processors are labeled as 24xx and 84xx CPUs
distinguishing them from earlier Third-Generation Quad-Core AMD Opteron processors
(23xx and 83xx CPUs).
PCIe
Bridge
Coherent HyperTransport 1
HyperTransport 3 2.0 GT/sec.
4.8 GT/sec.
PCIe
Bridge
Note – Six-Core AMD Opteron processors can only be installed on systems with a motherboard
that supports HyperTransport 3, excluding earlier models of the Sun Fire X4600 M2 server. Ear-
lier Sun Fire X4600 M2 servers cannot be upgraded to support coherent HyperTransport 3. Sys-
tem CPU/Memory modules are specially keyed so that they may not be inserted in to
incompatible systems.
Figure 4. HT Assist sets up indices in the cache on each processor, greatly reducing
probe filter traffic.
HT Assist saves time, reduces probe filter traffic, and can increase performance, by:
• Reducing latency from inter-processor communication
• Increasing STREAM memory bandwidth by as much as 60% (application dependent)
• Providing significant improvements in HPC, database, and virtualization workloads
• Increasing performance for cache-sensitive applications on systems four-socket
systems
• Dual Dynamic Power Management allows each processor to maximize the power-
saving benefits of Enhanced AMD PowerNow! technology without compromising
performance. Dual Dynamic Power Management can reduce idle power consumption
and allow for per-processor power management in multi-socket systems to decrease
power consumption.
• AMD CoolCore Technology evaluates which parts of the die — the cores, the memory,
or both — are needed to support currently running applications. AMD CoolCore
Technology can cut power to unused transistor areas to reduce power consumption
and lower heat generation.
• AMD Smart Fetch Technology lets inactive cores write contents of their L1 and L2
caches to the shared L3 cache. This feature can allow the inactive cores to enter a
“halt” state and draw less power, reducing CPU power consumption while still
making the contents of their caches available to other active cores.
Chapter 3
Server Architecture
The Sun Fire X4640 server is designed to provide multisocket scalability and high
performance while offering high reliability and low power consumption. The sections
that follow detail physical and architectural aspects of the system.
System-Level Architecture
Figure 5 provides a high-level block diagram of the Sun Fire X4640 server motherboard,
illustrating a relatively simple and elegant design. The server supports up to eight Six-
Core AMD Opteron processors on CPU/Memory modules that insert directly into the
motherboard. Eight on-board DIMM slots are provided on each CPU/Memory module,
and are directly connected to the processor. CPU/Memory Modules are added in pairs,
and are interconnected by dedicated Coherent HyperTransport links (4.8 GT/sec.)
System configurations with two, four, six, and eight processors are supported. All
processors in a system must be the same type (six-core, same stepping, and the same
frequency).
CPU Module G
CPU Module B
CPU Module C
CPU Module D
CPU Module H
CPU Module A
CPU Module E
CPU Module F
Shared
100 MHz Bus
32 MB
Flash
+
LSI
+ DVI Video
over LAN
SAS 1064 USB Video Redirect MPC8248
Hub Rage XL SP
+ +
64 MB
DDR
SDRAM
BCM
5221 Managed
ILOM Power Supplies
PCI-X 100 MHz
PCI-X 100 MHz
PCI-E 8-Lane
PCI-E 8-Lane
PCI-E 4-Lane
PCI-E 8-Lane
PCI-E 8-Lane
PCI-E 4-Lane
The design of a HyperTransport topology for larger numbers of processors must take
hop distance into account. In particular, average hop distance increases when building
eight-socket systems when compared to that of four-socket systems. Minimizing the
number of hops is ideal, and Sun Fire X4640 servers accomplish minimum hop distance
by enhancing the simple ladder topology as shown in Figure 6.
Figure 6. Enhanced Twisted Ladder topologies yield reduced hop counts for larger
numbers of processors
Sun’s approach using Coherent HyperTransport technology minimizes hop count and
latency between processors. Since memory is controlled by each processor, latency for
remote memory accesses is also reduced. Figure 7 illustrates the hop counts for two-
socket, four-socket, six-socket, and eight-socket Sun Fire X4640 server configurations.
18 Server Architecture Sun Microsystems, Inc.
Though many choose to divide the resources of Sun Fire X4640 servers using
virtualization technology, workloads with higher CPU components and lower memory
latency/bandwidth needs scale best beyond four sockets.
2-Socket 4-Socket 6-Socket 8-Socket
CPU 0 1 CPU 0 1 2 3 CPU 0 1 2 3 4 5 CPU 0 1 2 3 4 5 6 7
0 0 1 0 0 1 1 2 0 0 1 1 2 2 3 0 0 1 1 2 2 2 2 3
1 1 0 1 1 0 1 1 1 1 0 2 1 1 2 1 1 0 2 1 2 2 1 2
2 1 1 0 1 2 1 2 0 1 1 2 2 1 2 0 2 1 1 2 2
A, B
3 2 1 1 0 3 2 1 1 0 2 1 3 2 1 2 0 1 1 2 2
4 2 1 1 2 0 1 4 2 2 1 1 0 2 1 2
A, D, E, H
5 3 2 2 1 1 0 5 2 2 1 1 2 0 2 1
6 2 1 2 2 1 2 0 2
A, B, D, E, F, H
7 3 2 2 2 2 1 1 0
A, B, C, D, E, F, G, H
CPU CPU
Module A Module H + +
PCI-X
+
8132 + IO-04
+ +
PCI-X
+
8132 + IO-04
CK8-04P
CK8-04P
+ + + +
PCI-X PCI-X
+
8132 + IO-04 +
8132 +
CK8-04P CK8-04P
I/O Subsystem
The Sun Fire X4640 I/O subsystem is connected to the CPU/Memory complex through a
HyperTransport link from processors A and H. The I/O bridges are implemented with the
NVIDIA NForce4 Professional 2200 (CK8-04) and the NVIDIA nForce4 Professional 2050
(IO-04) media and communications processors (MCPs). Other I/O components connect
to these three main chipsets, including:
• A quad-channel SAS RAID controller (LSI SAS 1064) attached to one of the AMD-8132
PCI-X tunnels on a shared bus with the PCI-X slot 0.
• Two 10/100/1000 dual Gigabit Ethernet MAC/PHY devices (Intel FW82546GB NIC)
• An ATI Rage XL video controller device attached to the PCI bus of the NVIDIA nForce
Professional 2200
• A Super I/O device connecting the Integrated Lights Out Management (ILOM) service
processor to the NVIDIA nForce Professional 2200 chip.
AMD-8132 Device
Side A Side B
HyperTransport Link HyperTransport Link Downstream
Host Tunnel
16 bits upstream 16 bits upstream Device
16 bits downstream 16 bits downstream
PCI-X PCI-X
Bridge A Bridge B
The NVIDIA nForce Professional 2050 (IO-04) supports three PCI Express ports (two x8
and one x4).
The ILOM daughter card is a standalone system with no dependencies on the operating
system. Physical interfaces are provided through the connections listed above. The
actual connected devices, however, are located on the main system board. More
information on ILOM capabilities is provided in Chapter 4. ILOM daughter card
connections include:
• DVI output port connected to the ATI Rage XL via FPGA to support remote graphics
redirection from the ILOM service with a maximum VGA resolution of 1024 x 768
• Three USB connections to the NVIDIA nForce Professional 2200 to provide remote
keyboard, mouse, and storage functionality
• LPC connection to the NVIDIA nForce Professional 2200 via an FPGA
• 10/100 MB Ethernet connected to the on-board Broadcom BCM5221 10/100 Ethernet
controller, which automatically detects straight or cross-over Ethernet cables
• External serial port multiplexed with the main system serial port for access to the
ILOM CLI and system console using serial port redirection (S-o-L, Serial-over-LAN)
CPU/Memory Modules
The Sun Fire X4640 server supports up to eight Six-Core AMD Opteron Series 8000
processors — each provided on a CPU/Memory module with associated slots for
memory DIMMs. CPU/Memory modules are also known as "books", and each contains:
• AMD Opteron CPU and heatsink
• 8 DDR2 memory DIMM slots
• CPU/Memory Voltage Regulator Module
• Airflow redirection baffles for greatest cooling efficiency
• A Fault Remind button and per-DIMM LEDs to help identify a failed component.
The CPU/Memory module (Figure 10) is connected to the system motherboard using a
pair of ganged connectors. Each processor receives core voltage input from an
individual voltage regulator module (VRM). The VRM regulates the system current and
voltage according to the VID code output by the CPU package. The VRM is a DC-to-DC
point-of-load convertor specified for +12 Volt input and programmable 100 Amp output.
Sun Fire X4640 CPU/Memory modules also support split-plane operation for support of
AMD Dual Dynamic Power Management.
Sun Fire X4640 CPU/Memory modules are not compatible with previous-generation Sun
Fire X4600 M2 servers. Likewise, the previous-generation CPU/Memory modules made
for Sun Fire X4600 M2 servers are not compatible with the Sun Fire X4640 server. CPU/
Memory modules are physically keyed to prevent accidental deployment in non-
compatible systems.
23 Server Architecture Sun Microsystems, Inc.
Figure 10. The Sun Fire X4640 server CPU/Memory Module (book) provides support
for a Six-Core AMD Opteron processor and up to eight DDR2 DIMMs.
Memory Subsystem
Each AMD Opteron processor includes a low-latency, high-bandwidth, integrated
memory controller that reduces latencies during memory access over traditional front-
side bus-based memory controllers, and each supports up to eight ECC Registered
DDR2/667 memory modules. Although the AMD Opteron processor's memory controller
works in 64-bit or 128-bit mode ECC operation, for best performance results, it is
recommended to run 128-bit ECC operation mode. To run in 128-bit mode, DIMMs are
populated in pairs such that they occupy one-half of the AMD Opteron processor's 128-
bit controller interface. The controller supports 1 bit per byte ECC, and the Sun Fire
X4640 server uses DDR2/667 (PC2-5300) registered DDR SDRAM modules.
Eight DDR DIMM slots are provided per processor, and they are color-coded white and
black to indicate population order (white = 0,1 and black = 2,3; white pair first). LED
fault indicators controlled by the ILOM Service Processor provide the ability to easily
identify failed DIMM modules. Memory capacity scales with the number of processors.
As a result, a dual-socket system can support a maximum of sixteen DIMMs (up to
128 GB with 8 GB DIMMs). A fully populated, eight-processor server supports a
maximum of 64 DIMMS or up to 512 GB of memory with 8 GB DIMMs.
24 Server Architecture Sun Microsystems, Inc.
Other features of the Sun Fire X4640 server memory architecture include:
• Dedicated on-die 128-bit wide DDR memory controller
• Memory bandwidth up to 10.7 GB/sec. @ DDR2/667
• Under 80 ns direct memory latency
• Registered ECC DDR2/667 (PC2-5300) DIMMs supported
• Up to 32 GB per CPU with 4 GB DIMMs in 8 DIMM slots per CPU
Four system fans insert from the top of the chassis, and four power supplies insert from
the rear of the chassis. The four power supplies provide N+N redundancy. The system
can continue to operate at full capacity with any combination of two of the four power
supplies.1
Enclosure
The 4U Sun Fire X4640 server enclosure is designed for use in a standard 19-inch rack
(Table 4).
1.Some restrictions apply when apply when all 64 DIMM slots are populated with 137 Watt CPUs.
26 Server Architecture Sun Microsystems, Inc.
The Sun Fire X4640 server includes the following major components:
• Support for two, four, six, or eight CPU/Memory modules, with each module
containing one Six-Core AMD Opteron 8000 series processor
• Eight DIMM slots per CPU/Memory module, supporting Up to 512 GB of memory with
Registered ECC DDR2/667 (PC2-5300) DIMMs
• Four drive bays for SAS disk drives and additional DVD-ROM
• Four on-board 10/100/1000Base-T Ethernet ports
• Five USB 2.0 ports (two forward, two rear facing, one internal)
• Six PCI Express expansion slots (four x8, and two x4)
• Two 64-bit PCI-X slots @ 100 MHz
• ILOM service processor daughter card with 10/100Base-T Ethernet port
• 19-inch rackmount kit
• Cable management arm (optional)
• Four 1133 Watt AC power supplies (hot-swappable in N+N redundant configuration)
with integral fans
• Four (N+1 redundant) hot-swappable cooling fans under environmental monitoring
and control, accessed directly from the top of the chassis
27 Server Architecture Sun Microsystems, Inc.
Cooling
Because heat is so destructive to electronic components, lowering operating
temperature can substantially increase reliability. Sun's cooling design enables the
CPUs to operate approximately 15 percent more efficiently than in some competitive
systems.
PSU Airflow
Chassis Divider
Motherboard
Airflow
Motherboard
Airflow
Figure 13. Sun Fire X4640 servers provide front-to-back airflow with airflow
segregated between power supplies and the main chassis
As shown, the Sun Fire X4640 server is partitioned by a chassis divider that segregates
airflow into two distinct chambers:
• The motherboard/PCI and fan chamber, and
• The Power Supply Unit (PSU) and storage chamber
Both areas are air cooled front-to-back in concert with typical modern datacenter
practice. The PSU and storage chamber is cooled by individual fans on the back of each
power supply. Air is drawn through the front of the server by these rear-located fans.
The motherboard/PCI and fan chamber is cooled by redundant rows of fans mounted in
29 Server Architecture Sun Microsystems, Inc.
front of the server behind the bezel (Figure 14). Fans are hot swappable, and each fan
has an LED fault indicator. Fans are accessible from the top of the server so that they
can be accessed without interrupting system operation.
Figure 14. Four N+1 redundant fans are accessed through the top of the Sun Fire
X4640 server.
Power
The Sun Fire X4640 server is powered by dual redundant hot-swappable power supply
units (PSUs) with integral fans. The server requires two PSUs, but up to four PSUs can
be used per system for redundancy. For optimal redundancy, separate circuits can be
used to eliminate the dependence on a single electrical circuit, and to provide for
consistent N+N redundancy with the PSUs themselves. Power supplies are monitored
and managed by the ILOM service processor. PSU fans are not individually managed by
ILOM, but are instead considered part of the PSU unit. As a result, the failure of a PSU
fan is reported by ILOM as a failure of the entire PSU.
30 Enterprise-Class Operating System and Management Software Sun Microsystems, Inc.
Chapter 4
Enterprise-Class Operating System and
Management Software
Unlike many x86/x64 systems, Sun Fire X4640 server is ideally suited for enterprise
environments. Not only are the systems designed with the datacenter in mind, but the
operating system and management software provided with these systems allow them
to serve the most important and mission-critical applications. The sections that follow
describe the Solaris 10 OS and key enterprise-grade management technology.
Solaris 10 OS Support
Among the available operating systems, the Solaris OS is ideal for large-scale enterprise
deployments. Supported on all of Sun’s x86/x64 and SPARC platforms, the Solaris OS
has specific features that can enhance flexibility and performance — as described in
the sections that follow.
• Solaris Containers for Consolidation, Secure Partitioning, and Virtualization
Solaris Containers comprise a group of technologies that work together to
efficiently manage system resources, virtualize the system, and provide a
complete, isolated, and secure runtime environment for applications. Solaris
Containers can be used to partition and allocate the considerable computational
resources of Sun Fire X4640 servers. Solaris containers are comprised of Solaris
Zones and Solaris Resource Management which work together with the Solaris
fair-share scheduler.
– Solaris Zones — Solaris Zones can be used to create an isolated and secure envi-
ronment for running applications. A zone is a virtualized operating system envi-
ronment created within a single instance of the Solaris OS. Zones can be used to
isolate applications and processes from the rest of the system. This isolation
helps enhance security and reliability since processes in one zone are prevented
from interfering with processes running in another zone.
– Resource Management — Resource management tools provided with the
Solaris OS lets administrators dedicate resources such as CPU cycles to specific
applications. CPUs in multicore multiprocessor systems — such as Sun Fire
X4640 servers — can be logically partitioned into processor sets and bound to a
resource pool, and can ultimately be assigned to a Solaris zone. Resource pools
provide the capability to separate workloads so that consumption of CPU
resources does not overlap. Resource pools also provide a persistent configura-
tion mechanism for processor sets and scheduling class assignment. In addition,
the dynamic features of resource pools let administrators adjust system
resources in response to changing workload demands.
31 Enterprise-Class Operating System and Management Software Sun Microsystems, Inc.
• Solaris Dynamic Tracing (DTrace) to Instrument and Tune Live Software Environments
When production systems exhibit nonfatal errors or sub-par performance, the
sheer complexity of modern distributed software environments can make accurate
root-cause diagnosis extremely difficult. Unfortunately, most traditional
approaches to solving this problem have proved time-consuming and inadequate,
leaving many applications languishing far from their potential performance levels.
The Solaris DTrace facility provides dynamic instrumentation and tracing for both
application and kernel activities — even allowing tracing of application
components running in a Java Virtual Machine (JVM™)1. DTrace lets developers and
administrators explore the entire system to understand how it works, track down
performance problems across many layers of software, or locate the cause of
aberrant behavior. Tracing is accomplished by dynamically modifying the
operating system kernel to record additional data at locations of interest. Best of
all, although DTrace is always available and ready to use, it has no impact on
system performance when not in use, making it particularly effective for
monitoring and analyzing production systems.
1.The terms "Java Virtual Machine" and "JVM" mean a Virtual Machine for the Java platform.
32 Enterprise-Class Operating System and Management Software Sun Microsystems, Inc.
• Solaris ZFS™
Solaris ZFS offers a dramatic advance in data management, automating and
consolidating complicated storage administration concepts and providing
unlimited scalability with the world’s first 128-bit file system. ZFS is based on a
transactional object model that removes most of the traditional constraints on I/O
issue order, resulting in dramatic performance gains. ZFS also provides data
integrity, protecting all data with 64-bit checksums that detect and correct silent
data corruption.
• A Secure and Robust Enterprise-Class Environment
Best of all, the Solaris OS doesn’t require arbitrary sacrifices. The Solaris Binary
Compatibility Guarantee helps ensure that existing Solaris applications continue
to run unchanged, protecting investments. Certified multilevel security protects
Solaris environments from intrusion. Moreover, Sun’s comprehensive Fault
Management Architecture means that elements such as Solaris Predictive Self
Healing can communicate directly with the hardware to help reduce both planned
and unplanned downtime.
• Hard disk drive backplane sensors to determine the presence and health of the disk
backplane
• Fan sensors to determine the presence of the fan tray, the speed of individual fans,
and to detect the failure of individual fans.
In-band communication to the ILOM service processor is provided via the host OS
running on the server. A variety of management tools can be used to access
management information on individual servers:
• Intelligent Platform Management Interface (IPMI) with IPMItool
IPMItool is a simple command-line interface to systems that support the
Intelligent Platform Management Interface (IPMI) v2.0 specification. IPMItool
provides the ability to read the sensor data repository and print sensor values,
display the contents of the system event log, print field-replaceable unit
information, read and set LAN configuration parameters, and perform remote
chassis power control. IPMItool was originally written to take advantage of IPMI-
over-LAN interfaces but it is also capable of using the system interface as provided
by a Linux kernel device driver such as OpenIPMI or a Solaris OS driver called BMC
that is provided with the Solaris 10 OS. IPMItool is available under a BSD
compatible license.
IPMItool is not designed to replace the OpenIPMI library but instead provides a
completely command-line oriented tool that can be used by administrators in
conjunction with other tools. Where possible, IPMItool supports comma-
separated values for output to facilitate parsing by other scripts or programs.
IPMItool is designed to run quick command-response functions that can be as
simple as turning the system on or off, or as complex as reading in the sensor data
records while extracting and printing detailed sensor information for each record.
servers provide SNMP MIBs to manage and monitor the servers using any SNMP-
capable network management system, such as HP OpenView Network Node
Manager (NNM), Tivoli, CA Unicenter, or IBM Director. The MIB data describes the
information being managed, reflects current and recent server status, and
provides server statistics.
SNMP v1, v2c, and v3 are supported with v3 selected by default (v1 and v2c are
disabled by default). SNMP “sets” can be selected or disabled (default). An IPMI-
specific trap called a Platform Event Trap, or PET, may also be generated. The
following SNMP MIBs are supported:
– SNMP-FRAMEWORK-MIB
– SNMP-USER-BASED-SM-MIB
– SNMP-MPD-MIB
– ENTITY-MIB
– SUN-PLATFORM-MIB
• Role-Based Administration
Different management users can be defined with corresponding roles and
responsibilities. Up to 10 user IDs can be created locally on the service processor
with each user ID consisting of a user name and the roles that are allowed. By
default, Administrator and Operator roles are defined. Authentication is carried
out against a local service processor database. Alternately, an LDAP client is
implemented in the ILOM service processor as well to allow authentication
against an LDAP server (LDAP groups must be mapped to service processor roles).
Up to 10 concurrent active sessions are supported on the service processor,
including serial, secure shell and web clients. User accounts can be authenticated
through LDAP, Radius, and Active Directory.
ILOM service processor and can be downloaded and executed on the management
console. Input/output of virtual devices is handled between ILOM on the server and
ILOM Remote Console on the web-based client management console.
CDROM, DVDROM
or .iso Image Remote Keyboard, Mouse and Storage
Emulated as USB Devices by ILOM
Figure 15. Remote keyboard, video, mouse, and storage (RKVMS) support in the
ILOM service processor allows full-featured remote management for Sun Fire
X4640 servers.
Sun Ops Center provides advanced management and monitoring features to the Sun
Fire X4640 server. The remote management interface discovers and presents the the
servers, making operations, detailed inventory, and status pages available to
37 Enterprise-Class Operating System and Management Software Sun Microsystems, Inc.
administrators. Servers can be discovered and organized into logical groups. Organizing
servers into groups also allows features such as OS deployment across multiple
systems.
Some of the functions available through Sun Ops Center software include operating
system provisioning, firmware updates (for both the BIOS and ILOM service processor
firmware), and health monitoring. In addition, Sun Ops Center includes a framework
allowing administrators to easily access inventory information, simplifying the task of
running jobs on multiple servers with server grouping functionality.
38 Conclusion Sun Microsystems, Inc.
Chapter 5
Conclusion
To curb the impact of sprawling datacenters, many organizations need x64 servers with
truly massive levels of scalability. Whether serving large database or high performance
computing applications, or consolidating multiple lower-powered servers, systems
must scale gracefully. The balanced design of Sun Fire X4640 server lets them scale in
processing power, memory, and I/O bandwidth, without arbitrary limitations.
The Sun Fire X4640 server features an innovative enterprise-class design that makes the
most of AMD’s Direct Connect Architecture. Based on innovative Six-Core AMD Opteron
processors, this server can supply a powerful building block to help scale, virtualize,
and consolidate datacenter infrastructure. Offering best-in-class performance,
remarkable density, and extensive system expandability, the Sun Fire X4640 server
makes the most of constrained datacenter resources. In addition, the server offers
extensive reliability, availability, and serviceability features along with improved energy
efficiency. Integrated Lights Out Management technology provides advanced
monitoring and management — at no extra cost.
Ultimately, even the most capable servers require enterprise-class operating systems
and tools. With a choice of the Solaris OS, Linux, Microsoft Windows, and VMware
virtualization software, these servers provide a wealth of popular options. The
Solaris OS in particular offers advanced and innovative features such as Solaris
Containers and DTrace that can help consolidate and tune enterprise deployments.
With powerful management tools such as Sun Ops Center, organizations can deploy
Sun Fire X4640 servers quickly with confidence, knowing that their choices can improve
their results as well as the bottom line.
39 Conclusion Sun Microsystems, Inc.
Sun Microsystems, Inc. 4150 Network Circle, Santa Clara, CA 95054 USA Phone 1-650-960-1300 or 1-800-555-9SUN (9786) Web sun.com
© 2009 Sun Microsystems, Inc. All rights reserved. Sun, Sun Microsystems, Java, JVM, Lustre, MySQL, NetBeans,Solaris, Sun Fire, Sun Ray, and StarOffice are trademarks or registered trademarks of Sun Microsystems, Inc. or
its subsidiaries in the United States and other countries. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the US and other countries. Products
bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc. AMD and Opteron are trademarks or registered trademarks of Advanced Micro Devices, Inc. Information subject to change
without notice. Printed in USA SunWIN #: 470898 11/09