Professional Documents
Culture Documents
Server Administration
ES-411
M9000
Student Guide
This product or document is protected by copyright and distributed under licenses restricting its use, copying, distribution, and
decompilation. No part of this product or document may be reproduced in any form by any means without prior written authorization of
Sun and its licensors, if any.
Third-party software, including font technology, is copyrighted and licensed from Sun suppliers.
Sun, Sun Microsystems, the Sun logo, Solaris, Sun Fire, Sun StorageTek, Java, Sunsolve, Sun4U, OpenBoot, Sun Enterprise, and Sun
StorEdge are trademarks or registered trademarks of Sun Microsystems, Inc. in the U.S. and other countries.
All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. in the U.S. and
other countries. Products bearing SPARC trademarks are based upon an architecture developed by Sun Microsystems, Inc.
UNIX is a registered trademark in the U.S. and other countries, exclusively licensed through X/Open Company, Ltd.
Federal Acquisitions: Commercial Software Government Users Subject to Standard License Terms and Conditions
Export Laws. Products, Services, and technical data delivered by Sun may be subject to U.S. export controls or the trade laws of other
countries. You will comply with all such laws and obtain all licenses to export, re-export, or import as may be required after delivery to
You. You will not export or re-export to entities on the most current U.S. export exclusions lists or to any country subject to U.S. embargo
or terrorist controls as specified in the U.S. export laws. You will not use or provide Products, Services, or technical data for nuclear, missile,
or chemical biological weaponry end uses.
THIS MANUAL IS DESIGNED TO SUPPORT AN INSTRUCTOR-LED TRAINING (ILT) COURSE AND IS INTENDED TO BE
USED FOR REFERENCE PURPOSES IN CONJUNCTION WITH THE ILT COURSE. THE MANUAL IS NOT A STANDALONE
TRAINING TOOL. USE OF THE MANUAL FOR SELF-STUDY WITHOUT CLASS ATTENDANCE IS NOT RECOMMENDED.
Please
Recycle
Copyright 2010 Sun Microsystems Inc., 4150 Network Circle, Santa Clara, California 95054, Etats-Unis. Tous droits rservs.
Ce produit ou document est protg par un copyright et distribu avec des licences qui en restreignent lutilisation, la copie, la distribution,
et la dcompilation. Aucune partie de ce produit ou document ne peut tre reproduite sous aucune forme, par quelque moyen que ce soit,
sans lautorisation pralable et crite de Sun et de ses bailleurs de licence, sil y en a.
Le logiciel dtenu par des tiers, et qui comprend la technologie relative aux polices de caractres, est protg par un copyright et licenci
par des fournisseurs de Sun.
Sun, Sun Microsystems, le logo Sun, Solaris, Sun Fire, Sun StorageTek, Java, Sunsolve, Sun4U, OpenBoot, Sun Enterprise, et Sun StorEdge
sont des marques de fabrique ou des marques dposes de Sun Microsystems, Inc. aux Etats-Unis et dans dautres pays.
Toutes les marques SPARC sont utilises sous licence sont des marques de fabrique ou des marques dposes de SPARC International, Inc.
aux Etats-Unis et dans dautres pays. Les produits portant les marques SPARC sont bass sur une architecture dveloppe par Sun
Microsystems, Inc.
UNIX est une marques dpose aux Etats-Unis et dans dautres pays et licencie exclusivement par X/Open Company, Ltd.
Lgislation en matire dexportations. Les Produits, Services et donnes techniques livrs par Sun peuvent tre soumis aux contrles
amricains sur les exportations, ou la lgislation commerciale dautres pays. Nous nous conformerons lensemble de ces textes et nous
obtiendrons toutes licences dexportation, de r-exportation ou dimportation susceptibles dtre requises aprs livraison Vous. Vous
nexporterez, ni ne r-exporterez en aucun cas des entits figurant sur les listes amricaines dinterdiction dexportation les plus courantes,
ni vers un quelconque pays soumis embargo par les Etats-Unis, ou des contrles anti-terroristes, comme prvu par la lgislation
amricaine en matire dexportations. Vous nutiliserez, ni ne fournirez les Produits, Services ou donnes techniques pour aucune utilisation
finale lie aux armes nuclaires, chimiques ou biologiques ou aux missiles.
CE MANUEL DE RFRENCE DOIT TRE UTILIS DANS LE CADRE DUN COURS DE FORMATION DIRIG PAR UN
INSTRUCTEUR (ILT). IL NE SAGIT PAS DUN OUTIL DE FORMATION INDPENDANT. NOUS VOUS DCONSEILLONS DE
LUTILISER DANS LE CADRE DUNE AUTO-FORMATION.
Please
Recycle
Table of Contents
v
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Objectives ........................................................................................... 2-1
Additional Resources ........................................................................ 2-3
Operational Overview....................................................................... 2-4
The Jupiter Interconnect.................................................................... 2-5
Functionality .............................................................................. 2-7
CPU and Memory Board Operational Overview.......................... 2-8
SPARC64 VI and VII CPUs...................................................... 2-8
SPARC64 VI and VII Microprocessor Memory .................... 2-9
I/O Overview ................................................................................... 2-14
Mid-Range Server IOU........................................................... 2-14
High-End Server IOU ............................................................. 2-15
Board Nomenclature ....................................................................... 2-16
Extended System Controller Facility Unit (XSCFU) ................... 2-24
Global I2C Bus (Maintenance Bus) ....................................... 2-24
Service Processor Hardware Architecture ................................... 2-26
Service Processor Block Diagram ......................................... 2-26
Service Processor Component Breakdown ......................... 2-27
Exercise: Identifying the Architectural Components
of the MX000 Servers .................................................................... 2-29
Task ........................................................................................... 2-29
Exercise Summary............................................................................ 2-31
Exercise Solutions ............................................................................ 2-32
Task ........................................................................................... 2-32
Service Processor Architecture and
Configuration ................................................................................... 3-1
Objectives ........................................................................................... 3-1
Additional Resources ........................................................................ 3-3
Extended System Control Facility Functions................................. 3-5
Service Processor Functionality .............................................. 3-5
M8000 and M9000 Servers .................................................... 3-10
M9000 Server With Expansion Cabinet ............................... 3-11
Service Processor Comparison Summary .......................... 3-12
Serial Management Port Connections................................. 3-13
Configuring the Service Processor ................................................ 3-14
Gathering Required Information .......................................... 3-14
Initial Login to the Service Processor .................................. 3-15
Adding or removing a User Account.................................. 3-17
The adduser Command......................................................... 3-17
The deleteuser Command .................................................. 3-18
The password Command ...................................................... 3-19
Managing Server Administrative Privileges................................ 3-21
Access Control Privileges ...................................................... 3-21
The setprivileges Command ........................................... 3-23
The showuser Command ..................................................... 3-25
Setting the Service Processor Time...................................... 3-28
vii
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Objectives ........................................................................................... 4-1
Additional Resources ........................................................................ 4-3
Managing Sun SPARC Enterprise Servers ..................................... 4-5
Gathering Additional Service Processor Configuration
Information ............................................................................. 4-5
Managing XSCF User Accounts....................................................... 4-6
The setpasswordpolicy Command.................................... 4-7
The showpasswordpolicy Command .................................. 4-8
The disableuser Command............................................... 4-10
The enableuser Command ................................................. 4-11
The setautologout Command .......................................... 4-12
The showautologout Command ........................................ 4-13
The who Command ................................................................ 4-14
Managing XSCF User Accounts With LDAP............................... 4-15
Commands Used to Configure LDAP ................................. 4-15
General LDAP Setup Procedure ........................................... 4-16
The setlookup Command .................................................... 4-16
The showlookup Command .................................................. 4-17
The setldap Command......................................................... 4-18
The showldap Command ...................................................... 4-19
Managing Additional Network Settings ...................................... 4-22
The setsmtp and showsmtp Commands ........................... 4-23
Sample Command Output.................................................... 4-25
The setemailreport Command ........................................ 4-26
The showemailreport Command ...................................... 4-27
Configuring the Domain Name System ....................................... 4-28
Commands Used to Configure DNS.................................... 4-28
The setnameserver Command .......................................... 4-29
The shownameserver Command ......................................... 4-30
Configuring SNMP .......................................................................... 4-31
SNMP Traps............................................................................. 4-32
Setup Information ................................................................... 4-33
Commands Used to Configure SNMP................................. 4-34
The setsnmp Command......................................................... 4-34
The showsnmp Command ..................................................... 4-36
The setsnmpusm Command ................................................. 4-37
The setsnmpvacm Command................................................ 4-38
The setsunmc Command ...................................................... 4-40
The showsunmc Command .................................................... 4-42
Managing the Server Environment ............................................... 4-43
The setaltitude Command............................................... 4-44
The showaltitude Command............................................. 4-47
The showenvironment Command ...................................... 4-48
The setlocator Command ................................................. 4-50
The showlocator Command............................................... 4-51
The setpowerupdelay Commands .................................... 4-52
ix
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Domain Configuration on the High-End Servers ................ 5-7
Domain Configuration Unit (DCU)....................................... 5-8
Domain Configuration .......................................................... 5-14
Configuring Static Domains ........................................................... 5-15
The showboards Command................................................ 5-16
The setupfru Command ..................................................... 5-17
The setdcl Command.......................................................... 5-19
The addboard Command ..................................................... 5-22
The deleteboard Command............................................... 5-24
The moveboard Command ................................................... 5-26
The setdomainmode Command .......................................... 5-27
The setdomparam Command............................................... 5-29
Controlling Power to the Domain ................................................. 5-30
The poweron Command......................................................... 5-30
The poweroff Command ...................................................... 5-31
Accessing the Domain Console...................................................... 5-32
The console Command......................................................... 5-32
The showconsolepath Command ....................................... 5-34
Displaying the Domain Devices and Status................................. 5-35
The showdevices Command................................................ 5-35
The showdomainstatus Command..................................... 5-37
Resetting the Domain ...................................................................... 5-39
The sendbreak Command .................................................... 5-39
The reset Command ............................................................. 5-39
The OBP Device Tree....................................................................... 5-41
M8000 Uni-mode XSB 00-0 OBP Device Tre ...................... 5-67
Processor Device Mapping ............................................................. 5-73
Decoding Processor Locations on the MX000 Servers....... 5-73
Decoding I/O Card Locations on the High-End Servers. 5-82
External I/O Expansion Unit ................................................ 5-86
The Solaris OS................................................................................... 5-88
Configuring the Solaris OS in a New Domain............................. 5-89
Configuring Services for the Solaris 10 OS.......................... 5-89
Finishing the Installation ....................................................... 5-90
Exercise: Administering and Configuring the Domain.............. 5-91
Task 1 Building a Domain with a uni-mode XSB........... 5-92
Task 2 Building a Domain with a quad-mode XSB........ 5-94
Task 3 Building a Domain with two quad-mode XSBs . 5-96
Task 4 Sending a break signal to the OS.......................... 5-98
Exercise Summary............................................................................ 5-99
Exercise Solutions .......................................................................... 5-100
Task 1 Building a Domain with a uni-mode XSB......... 5-101
Task 2 Building a Domain with a quad-mode XSB...... 5-104
Task 3 Building a Domain with two quad-mode XSBs 5-106
Task 4 Sending a break signal to the OS........................ 5-108
xi
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Archived Data Types................................................................ 7-5
Log Archiving Functional Overview .................................... 7-6
Configuring Log Archiving .............................................................. 7-7
The setarchiving Command................................................ 7-7
The showarchiving Command ............................................ 7-9
Log Archiving Task Summary ............................................. 7-10
Configure the Log Archive Host .......................................... 7-10
Auditing ............................................................................................ 7-12
Audit Records.......................................................................... 7-12
Audit Trails .............................................................................. 7-12
Audit Events ........................................................................... 7-13
Audit Classes .......................................................................... 7-14
Audit Policy ............................................................................. 7-14
Performing Audit Tasks.................................................................. 7-15
The setaudit Command ...................................................... 7-15
The showaudit Command ................................................... 7-18
The viewaudit Command .................................................... 7-20
Audit Task Summary ............................................................ 7-23
The snapshot Command ............................................................... 7-25
Options and Parameters........................................................ 7-26
Operational Overview........................................................... 7-28
Modes of Operation ............................................................... 7-29
Sample Command Output.................................................... 7-30
Viewing Raw Snapshot Data................................................ 7-31
Viewing and Monitoring Logs ....................................................... 7-32
The showlogs Command ...................................................... 7-32
The showmonitorlog Command ........................................ 7-36
Fault Management Architecture.................................................... 7-37
Service Processor Implementation ....................................... 7-37
The fmd Daemon .................................................................... 7-38
The fmadm Command ............................................................ 7-39
The fmdump Command.......................................................... 7-40
The fmstat Command.......................................................... 7-46
Collecting Fault Data .............................................................. 7-48
FRU Replacement Guidelines ........................................................ 7-49
Backing up service processor information ......................... 7-50
Restoring service processor configuration information ... 7-52
Removing service processor information........................... 7-54
Collecting Data With Sun Explorer Software .............................. 7-56
Installing and Running the Sun Explorer Utility .............. 7-57
Viewing a Sun Explorer Capture......................................... 7-58
Gathering Escalation Information ................................................. 7-60
Providing General Information ............................................ 7-60
Providing Detailed Information ........................................... 7-60
Exercise: Collecting Data and Viewing Logs ............................... 7-61
Task 1 Configuring Log Archiving................................... 7-62
xiii
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Module 1
Objectives
Upon completion of this module, you should be able to:
Compare and contrast the five Sun SPARC Enterprise MX000 server
models
Differentiate hardware features of the Sun SPARC Enterprise M3000
entry-level server
Differentiate hardware features across the Sun SPARC Enterprise
MX000 mid-range server product line
Differentiate hardware features across the Sun SPARC Enterprise
MX000 high-end server product line
List the common features of all five Sun SPARC Enterprise MX000
server models
1-1
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Objectives
Relevance
Additional Resources
Server Technologies
Each of the MX000 servers merges mainframe-like technologies for high
reliability and the associated know-how accumulated over time with the
high-speed technologies of supercomputers and the flexibility of UNIX
server development.
Resource Flexibility
Each M3000, M4000, M5000, M8000, M9000, and M9000 with Expansion
Cabinet server contains SPARC 64 VI or SPARC 64 VII
microprocessors, which contain a multicore architecture.
Error Isolation
Capability Description
Server Features
The M8000, M9000 and M9000 with Expansion Cabinet servers hardware
configuration includes the following components:
Centerplane
CPU/memory board(s) (CMU)
Two or Four processors per board
Sixteen or Thirty Two dimms per board
I/O board(s)
Eight PCI-E card slots
Eight PCI cassettes
Four HDD drives
Tape drive unit (optional), and DVD drive
Jupiter System bus
Symmetric multiprocessing (SMP), where any processor in the same
domain can connect to a single shared main memory
Two eXtended System Control Facility Unit (XSCFU_B) boards
known as the service processors, One Active, One Standby
Operator panel (OPNL)
Redundant power and cooling
Hot FRU removal/replacement capability for fan trays, power
supplies, service processors, CMUs, IOUs, DC-DC convertors
(DDCs)
I/O expansion with the Sun External I/O Expansion Unit
Note In many cases, field-replaceable units for the M8000, M9000, and
M9000 with Expansion Cabinet servers are designed with different size
dimensions when compared with the field-replaceable units in the M3000,
M4000 and M5000 servers.
Table 1-2 Features of the Sun SPARC Enterprise M8000, M9000, and M9000 With
Expansion Cabinet Servers
M9000 With
Components M8000 M9000 Expansion
Cabinet
Table 1-2 Features of the Sun SPARC Enterprise M8000, M9000, and M9000 With
Expansion Cabinet Servers (Continued)
M9000 With
Components M8000 M9000 Expansion
Cabinet
Service Two XSCFU_B Two XSCFU_B Two XSCFU_B
Processors Two XSCFU_C
Power supplies 9 units (no 15 units (no 30 units (no
expansion expansion expansion
cabinet) N+1 cabinet) N+1 cabinet) N+1
Three power Five Power Ten power
cords cords cords
Table 1-2 Features of the Sun SPARC Enterprise M8000, M9000, and M9000 With
Expansion Cabinet Servers (Continued)
M9000 With
Components M8000 M9000 Expansion
Cabinet
Server 70.9 x 29.5 x 70.9 x 33.5 x 70.9 x 65.9 x
dimensions: 49.6 inches 49.6 inches 49.6 inches
height x width
x depth 180 x 75 x 180 x 85 x 180 x 167.4 x
126 mm 126 mm 126 mm
Clearance and
service area 31.5 inches 31.5 inches 31.5 inches
front 800 mm 800 mm 800 mm
Power Supports
Expansion Single Phase
Cabinet Dual power 70.9 x 12.5 x 70.9 x 12.5 x 70.9 x 12.5 x
Dimensions: feed and 49 inches 49 inches 49 inches
height x width Three phase
x depth power (Y(Star) 180 x 31.7 x 180 x 31.7 x 180 x 31.7 x
(Air flow is and Delta) 124.4 mm 124.4 mm 124.4 mm
from front to
rear)
Power
Expansion 770 lb 770 lb 770 lb
Cabinet 350 kg 350 kg 350 kg
weight
Term Description
Term Description
CMU board High-end server board containing either 2 or 4
processors and 32 DIMM slots.
Each CPU consists of two(SPARC64VI) or
four(SPARC64 VII) cores.
Each core is dual threaded.
I/O Unit (IOU) board The IOU contains PCI card slots.
The mid-range server IOU provides 5 card slots and 2
network interfaces.
The high-end server IOU provides 8 card slots and 4
HDD drives.
Physical System Board (PSB) The mid-range server PSB consists of at least one
set CPUM, one MEMB, and one IOU. A PSB set can be
partially populated or fully populated.
The high-end server PSB consists of at least one CMU
and one IOU. The CMU board can be partially
populated or fully populated.
Domain Component List Each domain has its own Domain Component List
(DCL) and Logical System (DCL).
Board (LSB)
Each of these DCLs contain 16 Logical System Board
(LSB) numbers ranging from 0 to 15.
Before any XSB can be added to any one domain, it
must first have an associated LSB number in the
corresponding DCL.
The M8000 has four PSB (PSB# 00, 01, 02, 03) sets that contain the
following:
PSB#00
CMU#0 and IUO#0
PSB#1
CMU#1 and IOU#1
PSB#2
CMU#2 and IOU#2
PSB#3
CMU#3 and IOU#3
The M9000-32 has eight PSB sets number 0 through 7, the M9000-64 has
sixteen PSB sets numbered 0 through 15.
In the MX000 mid-range servers, the CPUs are located on CPUM boards
and the memory is found on MEMB boards.
In the MX000 high-end servers, the CPUs and memory are located on the
same physical board, the CMU.
All CPUs in the MX000 servers use the SPARC64 VI or SPARC64 VII
high-performance multiprocessor. The microprocessor implements an
instruction retry function allowing an operation to continue when an
error has been detected.
Multithreading Technology
The CPUs have two or four physical cores and each core has two strands
with VMT structures. This allows threads to run in parallel. The two
strands that belong to the same physical core share most of the physical
resources, while the physical cores do not share physical resources except
the Level 2 (L2) cache and system interface.
Features Summary
The DDR2 memory dimms are 1-Gbyte, 2-Gbyte, or 4-Gbyte with the
following features:
Error Correcting Code (ECC) protection
Recovery from memory chip failures
Memory mirroring (optional)
Extended ECC memory support
Memory Mirroring
The memory mirror mode is supported for every pair of memory buses in
a CMU. If an error occurs in one bus, the memory mirror mode enables
continued operation using the other, non-defective bus. This feature is
optional and must be enabled by the system administrator.
Extended ECC, or Advanced ECC, is a technology that was used for the
NASA pathfinder mission to Mars. Extended ECC memory has the ability
to correct multibit memory errors and, in doing so, increases system
availability considerably.
Extended ECC operations are essentially the same as RAID for disk
subsystems. When writing data to the DIMM, a duplicate set of data in
the form of a checksum is written to another part of the memory
subsystem. If a memory failure occurs, then the data is immediately
recovered by re-calculating the data from the checksum information. This
procedure allows the system to mask single bit errors that standard ECC
memory can correct but also 2-, 3-, and 4-bit errors. In some cases, even a
whole DRAM chip failure simply results in a correctable error (CE).
I/O Features
The entry-level M3000 provides I/O through the motherboard and four
PCI-E I/O slots.
The M4000 and M5000 servers upper four slots of the IOU support PCI-E
cards while the lower slot supports a single PCI-X card. Each IOU also
supports two Gigabit ethernet ports.
The M8000 and M9000 servers IOU has eight PCI-E card slots. Each IOU
also supports four HDD drives.
The MX000 mid-range and high-end servers support a mix of PCI-E and
PCI-X cards. The industry-standard PCI cards used are not hot-pluggable,
so the PCI cards are installed into a hot-pluggable PCI Cassette unit.
Max Number of
Server PCI Slots PCI-E PCI-X
IOUs
M3000 0 4 4 0
M4000 1 5 4 1
M5000 2 10 8 2
M8000 4 32 32 0
M9000 8 64 64 0
M9000 with 16 128 128 0
Expansion
Cabinet
Note The mid-range and high-end servers (but not entry-level) support
a Sun External I/O Expansion Unit that provides additional PCI slots.
Additional PCI-E or PCI-X slots can be accessed by installing a PCIB-EX
card into a PCI-E slot and connecting an external I/O Expansion Unit.
This I/O expansion unit can be ordered with PCI-E and/or PCI-X slots.
The MX000 servers currently support the disk and tape storage devices
listed in Table 1-5.
The Sun External I/O Expansion Unit provides a host system with
additional slots for PCI cards. The single I/O boat configuration provides
six additional PCI-X or PCI-E slots. The dual I/O boat configuration
provides twelve additional PCI-X or PCI-E slots.
The I/O Expansion Unit chassis includes the centerplane and two
permanently attached internal AC cables.
Self-paced course This course only presents information about the Sun
External I/O Expansion Unit that is specific to the MX000 server models.
For additional training about the I/O Expansion Unit, the following Web-
based course is available:
WET-5393: Sun External I/O Expansion Unit Installation and Administration.
XSCF Features
The MX000 servers all use the XSCF to provide system monitoring and
control. This hardware component is called the Extended System Control
Facility unit (XSCFU).
While input power is being supplied to the server, the XSCF firmware
constantly monitors and manages the server even if the power to domains
is turned off.
RAS Features
The RAS features designed into the MX000 servers are listed in Table 1-6.
Feature Description
Feature Description
Availability Redundant components - A strategy used to
improve availability. When one component fails, the
redundant component(s) ensure that services that
were delivered by the failed component continue to
be available. Power supplies and fans are examples
of redundant components in both systems.
Predictive failure analysis - Ability to maintain a log
and threshold of errors and take proactive recovery
actions that minimize the likelihood of a system
outage.
Limited single point of reboot failure (hardware) -
Limited single hardware point of failure prevents
any domain from rebooting the OS.
Fault isolation of the crossbar interconnect
Extended ECC memory support - Recovers from
memory chip failures. Dynamic Random Access
Memory (DRAM) failures result in a correctable
error.
Dual power grid capable - The alternating current
(AC) power subsystem is completely duplicated,
which allows a redundant connection to a different
power grid (not available on the M4000 or M5000
servers - although they do offer power supply
redundancy).
Automated email notification system - Ability of the
server to detect a problem and to send event
notification through email, the SNMP trap, the BUI
interface, or to remote system monitors.
Enhanced retry - Support of an enhanced retry and
degradation function for the detected faults.
Automatic system reboot - Shortening the system
downtime by using automatic system reboot.
XSCF collection of fault information -
The XSCF collects information about
faults, log files, configurations, and environmental
conditions.
Feature Description
Serviceability First fault isolation - Based on in-line error detection
with the capability to detect errors within FRU
boundaries.
Error logging - Ability to immediately log errors.
Multistage alerts and reporting - Errors are logged
and reported to higher software levels and made
available to system management and remote
monitoring.
Directive maintenance - Ability to automatically
direct the maintenance activities of the service
personnel.
Concurrent maintenance (limited) - Ability to
perform maintenance activities while the system is
in operation.
Machine readable FRU identification (FRU-ID) -
Ability to electronically read FRU-ID information,
such as part numbers and serial numbers.
Application Specific Integrated Circuits (ASIC) and
processor built-in self-test (BIST) - Provide
chip (ASIC) and processor tests.
Dynamic system domains, including domain
isolation - Provide multiple domains. Each domain
is capable of running different OS instances and each
domain operates independently from other
domains.
System and FRU-level fault indicators, or light-
emitting diodes (LEDs) - Visually identify the
system status including FRU operational status and
service required.
XSCF firmware - Service processor firmware used
for platform administration, platform logging, and
service-related actions.
DR has three basic functions: add, delete, and move. These functions
allow you to:
Add Domain Configurable Units (DCUs) without stopping the OS to
manage domain resources (addboard).
Delete Domain Configurable Units (DCUs) without stopping the OS
to manage domain resources (deleteboard).
Move Domain Configurable Units (DCUs) without stopping the OS
to manage domain resources (moveboard).
Figure 1-1 Sun SPARC Enterprise M9000 Server With Power Cabinet
Table 1-7 on page 1-28 identifies the various hardware components found
in the M9000 server.
Component M9000
Component M9000
Exercise Summary
Exercise Solutions
This section contains solutions for the exercise.
Exercise Summary
Module 2
Objectives
Upon completion of this module, you should be able to:
Compare and contrast the Jupiter bus interconnect with bus
interconnects for Sun's predecessor mid-range and high-level server
product lines
Document the architectural similarities and differences across the
MX000 server product line
Classify and label architectural components across the entire MX000
server product line
Differentiate between uni-mode XSBs and quad-mode XSBs
Identify the terms used in nomenclature in the MX000 server line
Relevance
Additional Resources
Operational Overview
The MX000 server architecture builds and improves upon the success of
the Sun SPARC Enterprise family of servers by using a new CPU and bus
architecture. This enhanced architecture includes:
New system bus architecture based on the Jupiter bus interconnect
also called the Jupiter link
An enhanced CPU architecture running at speeds of 2 GHz and
greater
Multicore, multithreaded CPUs
SPARC64 VI Dual Core, Dual Threaded Processors
SPARC64 VII Quad Core, Dual Threaded Processors
A new service processor (XSCFU) with a fault management daemon
In the high-end servers, both the CPUs and memory reside on a CPU
Memory unit (CMU). The CMU board is inserted into the chassis and
plugs into a centerplane.
Figure 2-1 on page 2-7 shows the SPARC64 VI and VII processor chip
block diagram. Each core has its own floating point unit (FPU) and data
cache (D$). They do share the same L2 cache, which can be 5 or 6 Mbytes.
Note The M3000 offers the SPARC64 VII CPU with two or four cores.
The M4000, M5000, M8000, and M9000 servers offer the SPARC64 VII
CPU with four cores. Each core runs two threads.
Note The SPARC64 VII CPU cores all share the L2 cache and each core
has 64KB of I-cache and 64 KB of D-cache.
DIMMs are installed in two groups of four on the M3000. The minimum
configuration of memory for the M3000 is 4 GB.
You cannot mix larger DIMM sizes with smaller DIMM sizes in the same
group or set. The CMU board provides 16 A banks of memory and 16
B banks of memory. The largest size memory dimms are installed in the
A banks.
Memory Mirroring
The entry level M3000, the mid-range servers, and high-end servers
support memory mirroring. Memory mirroring operates by pairing
DIMMs together for both reads and writes. Mirroring divides the
available memory in half. Memory mirroring is turned off by default on
all systems.
On writes, the data and ECC are written to both halves of the pair. On
reads, data is read from both halves and the ECC is compared. Errors
during memory mirroring are handled as follows:
If both copies of the pair have correctable ECC and have matching
data, the data is returned.
If one copy has uncorrectable ECC and the other has correctable
ECC, the data is returned.
If both copies have uncorrectable ECC errors, an error data packet is
returned.
If both copies have correctable ECC, but the data does not match, an
error data packet is returned.
In the high-end models configured in quad-mode XSB mode, you get half
of MAC0 and MAC2 for XSB-0. This does not allow the memory mirror
mode because you can only mirror within a MAC.
IOU PCI-E slots #0 and #4 support the IOU Device Mounting card A
(IOU A card). When installed, this PCI-E card provides two external
Gigabit ethernet ports for use by a domain. The IOU A card installed
in PCI-E slot #0 provides access to the HDD drives in HDD slots #0
and #1. The IOU A card installed in PCI-E slot #4 provides access to
the HDD drives in HDD slots #2 and #3.
IOU PCI-E slots #2 and #6 also support IOU A cards. When installed,
these cards act as an attachment point for the DVD and DAT.
Any one domain supports a maximum of six IOU A cards.
Board Nomenclature
The boards in the MX000 servers are referenced with new terminology.
Before configuring the system, you should be familiar with the following
terms:
Physical System Board (PSB) set Consists of CPU, memory, and I/O
Specific boards are associated with specific with each PSB set
eXtended System Board (XSB)
Is the result of using your PSB in uni-mode (XX-0) with all of
the associated CPUs, memory boards, and IO being assigned to
the XSB
Is the result of using your PSB in quad-mode (XX-0, XX-1, XX-2,
XX-3) with the associated CPUs, memory boards, and IO being
divided among as many as four extents
The numbering scheme is based on the PSB number coming
first, followed by a dash and then the extent number.
Logical System Board (LSB) number To create domains, you must
first assign an LSB number to an XSB
There are 16 LSB numbers for every domain.
The LSB numbers range from 0 to 15.
Figure 2-3 High-End Server uni-mode XSB and quad-mode XSB Block
Diagram
The I2C bus monitors the server voltages, temperatures, and fan speeds.
In Figure 2-4 on page 2-17 you can see how the maintenance buses
interface within the system.
Component Details
Component Details
FMEM Spansion S29JL064H70TFI000
flash memory 8 Mbytes
64 Megabits (Mbits) (x2)
Uses two banks of memory, one for backup
Operating voltage: 3v
Maintenance Acts as the communication mechanism for various
bus components and provides:
I2C master control
JTAG controller functionality
Power supply monitoring
Interrupt handling
Reset control
Ports The following ports are provided:
USB 1.1 (x1)
10/100Base-T (x2)
Serial port (x1)
Uninterruptable Power Control (UPC) (x2)
RCI (x1) not used by Sun
Clock Epson Q414574B1000102
Entry-level and Mid-range servers only
High-accuracy crystal real-time clock
Automatic leap year adjustment
Clock is provided by the Active Centerplane on
the M8000
Clock is provided by the Clock Board on the
M9000s
Preparation
No special preparation is needed for this lab.
Task
Answer the following questions:
1. What are the functions of the system controller ASIC?
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
_____________________________________________________________
Exercise Summary
Exercise Solutions
This section provides solutions for the exercise.
Task
Answer the following questions:
1. What are the functions of the system controller ASIC?
The system controller is an ASIC that provides an interface among the memory
access controllers (MACs), the CPUs, and the I/O controllers.
2. Which type of interconnect bus is used in the MX000 servers?
The CPUM is a fru that consists of two CPUs and resides in the
mid-range servers. A CMU is a CPU/Memory unit that consists of 2 or 4 CPUs
and 16 or 32 DIMM slots and resides in the high-end servers.
4. Which servers support both PCI-X and PCI-Express cards without
the use of an IO-Box?
A uni-mode XSB (00-0) is a PSB set with all of the components assigned to it. A
quad-mode XSB is a PSB set with its components divided into four sections (00-
0, 00-1, 00-2, 00-3). The M3000 does not support quad-mode. The component
division is different on the mid-range and the high-end servers.
Objectives
Upon completion of this module, you should be able to:
Document the functions of the service processor
Compare and contrast the differences between the service processors
of the entry-level, mid-range, and high-end servers
List the features of the External Administration Network, Domain to
Service Processor Communication Protocol Network, and the
Internal Service Network
Compare the available physical network configurations
Perform an initial configuration of the service processor
Update the service processor firmware
3-1
Copyright July 2009 Sun Microsystems, Inc. All Rights Reserved. Sun Services, Revision C.1
Objectives
Relevance
Additional Resources
Function Description
Function Description
Figure 3-1 shows the front view of the service processors (XSCFU_B and
XSCFU_C boards) used in the high-end servers. The XSCFU_B ports are
the External LAN management ports, the USB port, the Serial
Management ports, and the RCI port.
Figure 3-2 illustrates the cabling between the XSCFU_B and XSCFU_C
boards within the M9000 server with Expansion Cabinet.
Data Cable
Figure 3-2 XSCFU_C Expansion Cabling for the M9000 Server With
Expansion Cabinet
M9000 With
M3000, M4000 and M8000 and M9000
Feature Expansion
M5000 Servers Servers
Cabinet Servers
The proper pinouts for both the MX000 serial management port and
terminal concentrator are provided below:
Sun System RJ45 annex TC RJ45
Serial Port port
=========================================================
Pin Desc. Color Color Desc. Pin
=========================================================
1 RTS White/Orange <----> White/Orange CTS 8
2 DTR Orange <----> Orange DSR 7
3 TXD White/Green <----> White/Green RXD 5
4 GND Blue <----> Blue GND 4
5 GND White/Blue <----> White/Blue GND 6
6 RXD Green <----> Green TXD 3
7 DSR White/Brown <----> White/Brown DTR 2
8 CTS Brown <----> Brown RTS 1
Note The following procedure can also be used if the login password is
lost.
Upon login, create a new user account with a password and platadm and
useradm privileges. Log out and then log back in as the new user to start
the service processor configuration process.
Note Once any new user account has been created, the admin account is
disabled.
Note The UIDs 65534 and 65535 are reserved and cannot be used.
Note The following accounts are reserved for the service processor and
cannot be used: root, bin, daemon, adm, operator, nobody,
sshd, rpc, rpcuser, ldap, apache, ntp, admin, and
default.
The following example creates a user named bill with a UID of 250.
XSCF> adduser -u 250 bill
Note If a new user is created and a UID is not specified, the first new
users id will be 100. The maximum length of a user name is limited to 31
characters.
If the user is currently connected into the service processor, the users
connection is terminated at once. The users account is removed from the
system and the user cannot log back in.
Note Any XSCF user can set his or her own password. Only a user with
useradm privileges can set another users password.
The following example enables the password for the user darcy until
February 25, 2011:
XSCF> password -e 2011-02-25 darcy
The following example sets the password to lock 10 days after the
password expires for the user rob:
XSCF> password -i 10 rob
The following example changes the password for the user frank:
XSCF> password frank
New XSCF password:
Retype new XSCF password:
Note The service processor will give you a warning message if you do
not adhere to the password policy.
Note Privileges are preset in the XSCF software. The existing privileges
cannot be changed, nor can additional privileges be added.
Privilege Capabilities
none None. When the local privilege for a user is set to none,
that user has no privileges, even if privileges for that
user are defined in LDAP.
useradm Create, delete, disable, and enable user accounts.
Change a users password and password properties.
Change a users privileges.
View all platform states except audit state and trail.
Privilege Capabilities
domainmgr Perform domain power operations.
View all states of the hardware assigned to the
domain(s) on which this privilege is held.
View all states of the domain(s) on which this
privilege is held.
Domain-specific Privileges
In this example, domainadm privileges are being assigned for the user
gary to domains 1, 2, 3, 4, 6, and 9:
XSCF> setprivileges gary domainadm@1-4,6,9
In this example, all privileges are removed for the user bill.
XSCF> setprivileges bill none
Note If logged into the XSCF as root, you do not need to use the -l
option to view users. Otherwise, a permission denied message will be
returned.
Note Pressing the tab key twice in rapid succession will allow you to
display all of the available commands. Typing the word show and then
the tab key will display all of the commands that begin with show.
The tab key can also be used for command completion.
Note Using the up arrow key, you can scroll back through the
commands that have been entered. Using the down arrow key takes
you back through the commands most recently entered. Using the right
arrow and left arrow keys allows for editing of the displayed
command.
To set the time zone for the service processor, complete the following
steps:
1. Log in to the service processor using an account with platadm or
fieldeng privileges.
2. Display the current time zone:
XSCF> showtimezone -c tz
US/Eastern
3. View the available list of time zones:
XSCF> settimezone -c settz -a
Africa/Abidjan
Africa/Accra
Africa/Addis_Ababa
Africa/Algiers
Africa/Asmera
Africa/Bamako
...
4. Set the desired time zone to any of the allowed POSIX time zone
names, as shown in step 3. For example:
Configuring NTP
The network time protocol (NTP) is an Internet protocol used to
synchronize computer clocks to a known source.
The rebootxscf command allows the user to manually reboot the XSCF.
This command requires either platadm or fieldeng privileges to execute.
The following example resets the XSCF and automatically replies with
yes without displaying the prompt.
XSCF> rebootxscf -q -y
Note The MX000 servers are shipped without any of the networks
configured.
Network Interfaces
Note The XSCF0 interfaces apply to the entry-level M3000, the mid-
range servers, and the high-end servers. The XSCF1 and XSCF-LAN
interfaces apply only to the high-end servers.
Network Parameters
The XSCF External Administration Network parameters are for:
The Active service processor interfaces (xscf#0-lan#0 & xscf#0-
lan#1) (All MX000 servers)
The Standby service processor interfaces (xscf#1-lan#0 &
xscf#1-lan#1) (M8000 and M9000 servers only)
Netmasks
Network routes
Gateway addresses
xscf#0 hostname :
xscf#1 hostname :
DNS domain name :
nameserver :
nameserver :
interface :xscf#0-lan#0
status :down
IP address :10.6.15.21
netmask :255.0.0.0
route :-n 10.0.0.0 -m 255.0.0.0
route :-n 0.0.0.0 -m 0.0.0.0 -g 10.0.0.5
interface :xscf#0-lan#1
status :down
IP address :192.1.1.21
netmask :255.255.255.0
route :
interface :xscf#0-if
status :down
IP address :
netmask :
interface :lan#0
status :down
IP address :10.6.15.25
netmask :255.0.0.0
route :
interface :xscf#1-lan#0
status :down
IP address :10.6.15.22
netmask :255.0.0.0
route :-n 10.0.0.0 -m 255.0.0.0
route :-n 0.0.0.0 -m 0.0.0.0 -g 10.0.0.5
interface :xscf#1-lan#1
status :down
IP address :192.1.1.22
netmask :255.255.255.0
route :
interface :xscf#1-if
status :down
IP address :
netmask :
interface :lan#1
status :down
IP address :192.1.1.25
netmask :
route :
Continue? [y|n] :y
Please reset the XSCF by rebootxscf to apply the
network settings. Please confirm that the settings have
been applied by executing showhostname, shownetwork,
showroute and shownameserver after rebooting the XSCF.
XSCF> rebootxscf -y
The XSCF will be reset. Continue? [y|n] :y
The following example sets the host name for xscf#0 to godzilla-sp0.
XSCF> sethostname xscf#0 godzilla-sp0
The following example sets the host name for xscf#1 to godzilla-sp1.
XSCF> sethostname xscf#1 godzilla-sp1
The following example sets the domain name to sun.com for xscf#0 and
xscf#1.
XSCF> sethostname -d sun.com
Note Use the shownetwork with the -a option to see all of your
network interfaces.
The following example adds the route 10.7.1.0 and netmask 255.255.0.0 for
xscf#0-lan#0.
XSCF> setroute -c add -n 10.7.1.0 -m 255.255.0.0
xscf#0-lan#0
Commands Used
192.168.224.1 192.168.224.2
XSCFU Domain 0
192.168.224.4
Domain 2
DSCP Link
192.168.224.13
Domain 11
192.168.224.20
Domain 18
Location Address
---------- ---------
XSCF 192.168.224.1
Domain #00 192.168.224.2
Domain #01 192.168.224.3
Domain #02 192.168.224.4
Domain #03 192.168.224.5
Domain #04 192.168.224.6
Domain #05 192.168.224.7
Domain #06 192.168.224.8
Domain #07 192.168.224.9
Domain #08 192.168.224.10
Domain #09 192.168.224.11
Domain #10 192.168.224.12
Domain #11 192.168.224.13
Domain #12 192.168.224.14
Domain #13 192.168.224.15
Domain #14 192.168.224.16
Domain #15 192.168.224.17
Commands Used
Caution The DSCP network should only be configured when there are
no domains running. If a change is made to the DSCP network while a
domain is active, the domain must be rebooted before the service
processor can communicate with it. Specific Solaris services must be
enabled for DSCP to function.
The XSCF ISN allows the active service processor to exchange pertinent
system management information with the standby service processor so
that if a failure occurs, the standby service processor can take over as the
new active service processor without any interruption of domain service.
Network Interfaces
The XSCF Internal Service Network consists of one ISN interface for each
service processor, and provides for a private internal communication. The
interfaces are seen in Table 3-5.
Commands Used
Note Enabling telnet does not require that the XSCF be reset with the
rebootxscf command, but disabling telnet does. Refer to the man
pages.
3. Enable ssh:
XSCF> setssh -c enable
Continue? [y|n] :y
Please reset the XSCF by rebootxscf to apply the ssh
settings.
Note Enabling ssh requires that the XSCF be reset with the rebootxscf
command, but disabling ssh does not.
4. Enable https:
XSCF> sethttps -c selfsign US MA Burlington SLS OPL
ssp0 root@ssp0
Enter passphrase: cangetin
Verifying - Enter passphrase: cangetin
XSCF> sethttps -c enable
Continue? [y/n] : y
Please reset the XSCF by rebootxscf to apply the https
settings.
XSCF> rebootxscf -y
Note Enabling https requires that the XSCF be reset with the
rebootxscf command, but disabling https does not. Refer to the man
pages.
interface :xscf#0-lan#0
status :up
IP address :10.6.15.41
netmask :255.0.0.0
route :
interface :xscf#0-lan#1
status :up
IP address :192.1.1.41
netmask :255.255.255.0
route :
Continue? [y|n] :y
The XCP package is provided as a compressed file. The file name will be
representative of the server type that it should be used for:
XCP
XSCF
Uboot
Kernel
Linux open source pkgs
SCF applications
OBP/POST
(combined image)
To display the XCP version that is registered in the XSCF, execute the
following command:
XSCF> version -c xcp -t
XCP: 1080
To display the details of the XCP version that is registered in the XSCF,
execute the following command:
XSCF> version -c xcp -v -t
XCP: 1080
OpenBoot PROM : 02.07.0000
XSCF : 01.08.0003
The following sample imports the firmware from an MSP (10.6.15.50) for a
high-end server that has enabled http access (the first attempt fails
because the wrong firmware is specified):
XSCF> getflashimage -v http://10.6.15.50/firmware/FFXCP1081.tar.gz
Error: invalid file name: FFXCP1081.tar.gz
XSCF> getflashimage -v http://10.6.15.50/firmware/DCXCP1081.tar.gz
Existing versions:
Version Size Date
DCXCP1080.tar.gz 51299678 Wed Jan 07 15:35:48 EST 2009
MD5: 314f0258d48c1d323034073b9270c44c
Warning: About to delete existing versions.
Continue? [y|n]: y
Removing DCXCP1080.tar.gz.
Free space: 97MB
0MB received
1MB received
2MB received
.
47MB received
The following sample imports the firmware from an MSP (10.6.15.50) for a
mid-range server that has enabled anonymous ftp access:
XSCF> getflashimage -v ftp://10.6.15.50/pub/FFXCP1081.tar.gz
Existing versions:
Version Size Date
FFXCP1080.tar.gz 51299678 Wed Jan 07 15:35:48 EST 2009
MD5: 314f0258d48c1d323034073b9270c44c
Warning: About to delete existing versions.
Continue? [y|n]: y
Removing FFXCP1080.tar.gz.
Free space: 97MB
0MB received
1MB received
2MB received
.
47MB received
The following sample imports the mid-range server firmware from the
images directory on a USB drive:
Note You can update to 1081, 1080, 1072, 1071, 1070, 1061, 1060 firmware
from 1050 firmware. If your service processor is currently running
firmware older then 1050, you must first upgrade to the 1050 firmware
before upgrading to a higher level firmware.
To synchronize the firmware across both the Active and Standby service
processors, execute the following command:
XSCF> flashupdate -c sync
Checking the XCP image file, please wait a minute
The XCP file send to stand-by XSCF, please wait a minute
The stand-by XSCF update starting, it takes about 20 minutes,
for more detail, use showmonitorlog command.
Note The sync process would be used when a failed service processor
in a high-end server has been replaced with chassis power turned off.
Preparation
Your instructor will assign a server to each group. To complete this
exercise, you will need your servers hostname and the network
configuration. Use the Solaris cat command to read the /etc/hosts file
to determine your network settings.
Resource
NTS
IP Address
NTS Port #
Initial User Login installer
Initial User installer
Password
XSCF0-lan0
IP Address
XSCF0-lan0
netmask
XSCF0-lan1
IP Address
XSCF0-lan1
netmask
Service Processor
host name
Default Route None
IP Address
Domain 0
IP Address
Domain 0
netmask
Domain 1
IP Address
Domain 1
netmask
__________________________________________________
8. Verify the current user settings. Record the command used:
__________________________________________________
Exercise: Configuring the Service Processor
__________________________________________________
5. Test the External Network connection to your service processor,
using telnet:
a. Was the attempt successful? ______________________
b. If not, what is the problem? ______________________
Exercise Summary
Exercise Solutions
This section contains solutions to the exercise.
10. Verify the current version of firmware that has been imported to the
service processor:
Which command did you use? getflashimage -l
11. Perform the firmware update.
Which command did you use?
flashupdate -c update -m xcp -s 1081
12. After the service processor reboots, confirm that the firmware
version has been updated.
Which command did you use? version -c xcp
Module 4
Objectives
Upon completion of this module, you should be able to:
Manage user accounts
Configure the service processor to make use of the Lightweight
Directory Access Protocol (LDAP)
Configure the Domain Name System (DNS)
Configure the Simple Network Management Protocol (SNMP)
View the platform environment
View hardware configurations
Manage Capacity on Demand (COD)
Use the XSCF Web
Relevance
Additional Resources
Command Result
The most common options and parameters for this command are:
The following example sets the minimum days and number of passwords
to remember:
XSCF> setpasswordpolicy -n 12 -r 5
The following example sets the minimum password length and maximum
number of days that passwords are valid.
XSCF> setpasswordpolicy -m 8 -M 30
Note System accounts with UIDs less than 100 (such as root, bin, and
daemon), and UID 65534 and 65535 cannot be disabled.
The following example sets the session timeout of the XSCF shell to
30 minutes.
XSCF> setautologout -s 30
The following example displays the session timeout time of the XSCF
shell:
XSCF> showautologout
30 min
Note The LDAP server must support Privileges and Users in the
LDAP schema. Reference the Sun SPARC Enterprise
M4000/M5000/M8000/M9000 Servers Administration Guide for more
information.
Note Do not confuse this with authentication and privilege lookups for
the servers domains, which are provided by the Solaris OS.
Command Result
Command Result
setldap Configures the service processor as an
LDAP client
showldap Displays the LDAP configuration for
the service processor
Caution If you use the LDAP service, lookups are first performed
locally, and then through the LDAP server. If no privileges are specified
for a user, the setprivileges command deletes any local privilege data
for that user. Subsequently, if LDAP privilege is enabled, the users
privileges are looked up in LDAP. If the none privilege is specified for a
user, that user does not have any privileges, regardless of privilege data
in LDAP.
The following section provides examples using both the setldap and
showldap commands for simplicity.
Command Description
Authenti
Via SMTP sever
cation
Internet
XSCF
Parts fault
Unauthorized
Mail terminal System access
The setsmtp command configures an e-mail server for use on the XSCF.
The showsmtp command displays the current SMTP settings.
The setsmtp command can also be used without options. This will
invoke an interactive mode that asks you to provide:
Name of the mail server to use
Port number to use (default is 25)
Authentication mode (none, POP, or smtp-auth)
Valid e-mail address as the reply-to address
Command Result
Note After the service processor is configured to use the DNS service, it
does not require day-to-day management. However, if you replace a
service processor board, you must reconfigure the new service processor
to use the DNS service.
The following example adds two hosts with the IP addresses 10.18.108.10
and 10.24.1.2 as DNS servers.
XSCF> setnameserver -c add 10.18.108.10 10.24.1.2
The following example deletes the host with the IP address 10.18.108.10.
XSCF> setnameserver -c del 10.18.108.10
Configuring SNMP
Simple Network Management Protocol (SNMP) is an application-layer
protocol that facilitates the exchange of management information between
network devices. It is a standard part of the TCP/IP protocol suite.
There is also a function called TRAP which is used by the SNMP agent to
exchange special information in asynchronous communication with the
SNMP manager. The SNMP agent uses the 161 port and the 162 port for
traps by default (see Figure 4-2).
SNMP Traps
When an event occurs, the SNMP agent function notifies the SNMP
manager of the event. This function is called a TRAP. The XSCF TRAP
covers the following events (see Figure 4-3):
XSCF SNMP agent startup
Occurrences of unauthorized access to the XSCF SNMP agent
Hardware fault occurrences in the system
Replacement of a faulty component in the system and system
recovery
Notification of another XSCF event
System
XSCF
Trap issued
XSCF SNMP agent function started
System
XSCF
System
SNMP manager
XSCF
Parts fault
System
XSCF
Faulty part replaced
System
XSCF
Event reports.
The service processor can export the following information to any SNMP
manager:
System information, such as chassis ID, platform type, total number
of CPUs, and total memory
Configuration of the MX000 server hardware
Dynamic reconfiguration information, including which
domain-configurable units are assigned to which domains
Domain status
Power status
Environmental status
Setup Information
The SNMP agent on the service processor is disabled by default and must
be configured and enabled by an administrator. The SNMP agent that is
bundled in the XSCF software includes the standard version 3 (v3)
protocols for:
User management
User Security Model (USM)
View Access Control Model (VACM)
Command Result
SNMP V1/V2c:
Status: Enabled
Community String: public
The following example displays the setup information and the agent
status.
XSCF> showsunmc
Agent Status: Enabled, running
Setup Status: Set up
SunMC Server: 10.6.15.50
Security Seed: maplesyr
SNMPv1 Community String: public
Agent Port: 1161
Host Trap Port: 162
Host Event Port: 163
SNMP Agent Port: 161
Domain Agent Port: 1161
Command Result
Note If a value greater than 0m and less then 100m is specified, the
setting will still default to 100m and all values are rounded up to the
nearest 100m.
Note The specified altitude becomes valid when the service processor is
rebooted. The showaltitude man page has yet to be updated to state
this correctly. Refer to the current XCP release notes for more information.
Setting the altitude impacts the fan speed levels and ultimately the
temperature of the MX000 servers as shown in Table 4-7, Table 4-8, and
Table 4-9.
Table 4-7 Fan speed levels corresponding to altitude and environmental
temperature (Entry-level system)
Low speed 20C or less 18C or less 16C or less 14C or less
(level 1)
Low speed 19-22C 17-20C 15-18C 13-16C
(level 2)
Low speed 21-24C 19-22C 17-20C 15-18C
(level 3)
Low speed 23-26C 21-24C 19-22C 17-20C
(level 4)
Middle 25-28C 23-26C 21-24C 19-22C
speed
(level 5)
Middle 27-30C 25-28C 23-26C 21-24C
speed
(level 6)
High speed 29-32C 27-30C 25-28C 23-26C
(level 7)
High speed 31-34C 29-32C 27-30C 25-28C
(level 8)
High speed More than More than More than More than
(level 9) 33C 31C 29C 27C
Low speed 25C or less 23C or less 21C or less 19C or less
Low speed 27C or less 25C or less 23C or less 21C or less
High speed More than More than More than More than
24C 22C 20C 18C
Wait Time
Warmup Time
The warm-up time is intended to prevent the PSU and the FAN from
running until the power supply environments of peripheral units are
prepared after the server starts the power-on processing. Once the warm-
up time is set, the OpenBoot PROM will start after the server power
supply is turned on, the power-on processing starts, and the set warm-up
time elapses.
If the system power has already been turned on and the system is
operating, the setting takes effect at the next startup.
The following example displays the current settings for the wait time and
warmup time:
XSCF> showpowerupdelay
warmup time : 10 minute(s)
wait time : 20 minute(s)
If a power recovery is reported by the UPS within the specified time, then
a shutdown will not occur.
The following example displays the current setting for the shutdown
delay in seconds:
XSCF> showshutdowndelay
UPS shutdown wait time : 300 second(s)
Command Result
Status States
The status for each FRU may be any of the following states:
Normal The unit is functioning normally.
Maintenance The unit is in a maintenance state.
Faulted The unit has been degraded due to a failure.
Degraded The unit has been partially degraded.
Deconfigured A unit has been degraded due to an error, or a unit
that was normal has now been degraded because another unit was
degraded.
MEM#02B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d054027;
+ Type:1A; Size:1 GB;
MEM#03A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d386;
+ Type:2B; Size:2 GB;
MEM#03B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d054820;
+ Type:1A; Size:1 GB;
MEM#10A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d38c;
+ Type:2B; Size:2 GB;
MEM#10B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009f19;
+ Type:1A; Size:1 GB;
MEM#11A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409db0f;
+ Type:2B; Size:2 GB;
MEM#11B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009f25;
+ Type:1A; Size:1 GB;
MEM#12A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409dafb;
+ Type:2B; Size:2 GB;
MEM#12B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009ec8;
+ Type:1A; Size:1 GB;
MEM#13A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d3e0;
+ Type:2B; Size:2 GB;
MEM#13B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009ec4;
+ Type:1A; Size:1 GB;
MEM#20A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d3e4;
+ Type:2B; Size:2 GB;
MEM#20B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009e62;
+ Type:1A; Size:1 GB;
MEM#21A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409daf0;
+ Type:2B; Size:2 GB;
MEM#21B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009e35;
+ Type:1A; Size:1 GB;
MEM#22A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409da83;
+ Type:2B; Size:2 GB;
MEM#22B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d054b25;
+ Type:1A; Size:1 GB;
MEM#23A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409daea;
+ Type:2B; Size:2 GB;
MEM#23B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d054327;
+ Type:1A; Size:1 GB;
MEM#30A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409db09;
+ Type:2B; Size:2 GB;
MEM#30B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d053f22;
+ Type:1A; Size:1 GB;
MEM#31A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d38a;
+ Type:2B; Size:2 GB;
MEM#31B Status:Normal;
+ Code:ce0000000000000001M3 93T2950CZ3-CD5 3343-46009f18;
+ Type:1A; Size:1 GB;
MEM#32A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409da98;
+ Type:2B; Size:2 GB;
MEM#32B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d054423;
+ Type:1A; Size:1 GB;
MEM#33A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d409daed;
+ Type:2B; Size:2 GB;
MEM#33B Status:Normal;
+ Code:c1000000000000005372T128000HR3.7A 352f-0d054921;
+ Type:1A; Size:1 GB;
CMU#1 Status:Normal; Ver:0101h; Serial:PP0640T679 ;
+ FRU-Part-Number:CA06620-D001 A8 ;
+ Memory_Size:64 GB;
CPUM#0-CHIP#0 Status:Normal; Ver:0201h; Serial:PP0637B123 ;
+ FRU-Part-Number:CA06620-D021 A6 ;
+ Freq:2.280 GHz; Type:16;
+ Core:2; Strand:2;
CPUM#1-CHIP#0 Status:Normal; Ver:0201h; Serial:PP0646C268 ;
+ FRU-Part-Number:CA06620-D021 A6 ;
+ Freq:2.280 GHz; Type:16;
+ Core:2; Strand:2;
MEM#32A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d3aa;
+ Type:2B; Size:2 GB;
MEM#32B Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d393;
+ Type:2B; Size:2 GB;
MEM#33A Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d392;
+ Type:2B; Size:2 GB;
MEM#33B Status:Normal;
+ Code:2cffffffffffffff0836HTF25672Y-53EB1 0100-d504d38b;
+ Type:2B; Size:2 GB;
IOU#0 Status:Normal; Ver:0101h; Serial:PP06433532 ;
+ FRU-Part-Number:CA06629-D027 A5 ;
PCI#0 Name_Property:pci; Card_Type:IOUA;
IOU#1 Status:Normal; Ver:0101h; Serial:PP06420636 ;
+ FRU-Part-Number:CA06629-D027 A5 ;
XSCFU_B#0 Status:Normal,Standby; Ver:0201h; Serial:PP0711002C ;
+ FRU-Part-Number:CA06620-D342 B0 /371-2228-02 ;
XSCFU_B#1 Status:Normal,Active; Ver:0201h; Serial:PP0638J133 ;
+ FRU-Part-Number:CA06620-D341 A7 ;
BP_A#0 Status:Normal; Ver:0101h; Serial:PP0640S354 ;
+ FRU-Part-Number:CA20006-B37X 013AL ;
DDC_A#0 Status:Normal; Ver:05D; Serial:MD06440124;
+ FRU-Part-Number:CA01022-0680 05D /371-2227-04 ;
DDC_A#1 Status:Normal; Ver:05D; Serial:MD06440096;
+ FRU-Part-Number:CA01022-0680 05D /371-2227-04 ;
OPNL#0 Status:Normal; Ver:0101h; Serial:PP0640S619 ;
+ FRU-Part-Number:CA06620-D381 A1 ;
PSU#0 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#1 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#2 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#3 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#4 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#5 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#6 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#7 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#8 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:On;
PSU#40 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#41 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#42 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#43 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#44 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#45 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#46 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#47 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
PSU#48 Status:Normal; Serial:;
+ FRU-Part-Number:CA01022-0690;
+ Power_Status:Breaker off;
FANBP_C#0 Status:Normal; Ver:0301h; Serial:PP0637A712 ;
+ FRU-Part-Number:CA21128-B73X 004AC ;
FAN_A#0 Status:Normal; Serial:PP0637D547;
+ FRU-Part-Number:CA06622-D011 A0;
FAN_A#1 Status:Normal; Serial:PP0637D545;
+ FRU-Part-Number:CA06622-D011 A0;
FAN_A#2 Status:Normal; Serial:PP0637D527;
+ FRU-Part-Number:CA06622-D011 A0;
FAN_A#3 Status:Normal; Serial:PP0637D526;
+ FRU-Part-Number:CA06622-D011 A0;
FAN_B#0 Status:Normal; Serial:PP0637C351;
+ FRU-Part-Number:CA06621-D011 A0;
FAN_B#1 Status:Normal; Serial:PP0637C359;
+ FRU-Part-Number:CA06621-D011 A0;
FANBP_C#1 Status:Normal; Ver:0301h; Serial:PP0637A732 ;
+ FRU-Part-Number:CA21128-B73X 004AC ;
FAN_B#2 Status:Normal; Serial:PP0637C346;
+ FRU-Part-Number:CA06621-D011 A0;
FAN_B#3 Status:Normal; Serial:PP0637C342;
+ FRU-Part-Number:CA06621-D011 A0;
FAN_B#4 Status:Normal; Serial:PP0637C345;
+ FRU-Part-Number:CA06621-D011 A0;
FAN_B#5 Status:Normal; Serial:PP0637C352;
+ FRU-Part-Number:CA06621-D011 A0;
FAN_B#6 Status:Normal; Serial:PP0637C348;
+ FRU-Part-Number:CA06621-D011 A0;
FAN_B#7 Status:Normal; Serial:PP0637C353;
+ FRU-Part-Number:CA06621-D011 A0;
SWBP#0 Status:Normal; Ver:0101h; Serial:PP06369451 ;
+ FRU-Part-Number:CA20396-B67X 006AE ;
MEDBP#0 Status:Normal; Ver:0101h; Serial:PP0638F685 ;
+ FRU-Part-Number:CA20396-B66X 005AC ;
The following example displays a CPU memory unit and memory module
on a motherboard that are degraded because of an error:
XSCF> showstatus
* CMU#0; Status:Deconfigured;
* XBU_B#0; Status:Faulted;
Select [1-5|c:cancel] :1
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select whether to replace a CMU only, an IOU only,
or both a CMU and an IOU.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a CMU to be replaced.
DomainID
No. FRU XSB#0 XSB#1 XSB#2 XSB#3 Power Status
--- ------------- ----------------------- ----- ---------------
1. CMU#0 SP SP SP SP Off Normal
2. CMU#1 SP SP SP SP Off Normal
3. CMU#2 -- -- -- -- --- Not installed
4. CMU#3 -- -- -- -- --- Not installed
---------------------------------------------------------------
Select [1-4|b:back] :b
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select whether to replace a CMU only, an IOU only,
or both a CMU and an IOU.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select an IOU to be replaced.
DomainID
No. FRU XSB#0 XSB#1 XSB#2 XSB#3 Power Status
--- ------------- ----------------------- ----- ---------------
1. IOU#0 SP SP SP SP Off Normal
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select whether to replace a CMU only, an IOU only,
or both a CMU and an IOU.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a CMU/IOU to be replaced.
DomainID
No. FRU XSB#0 XSB#1 XSB#2 XSB#3 Power Status
--- ------------- ----------------------- ----- ---------------
1. CMU#0/IOU#0 SP SP SP SP Off Normal
2. CMU#1/IOU#1 SP SP SP SP Off Normal
3. CMU#2/IOU#2 -- -- -- -- --- Not installed
4. CMU#3/IOU#3 -- -- -- -- --- Not installed
---------------------------------------------------------------
Select [1-4|b:back] :b
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select whether to replace a CMU only, an IOU only,
or both a CMU and an IOU.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a type of FRU to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a FAN to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a type of FRU to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a PSU to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a type of FRU to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select an XSCF to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a type of FRU to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a DDC_A to be replaced.
---------------------------------------------------------------
Maintenance/Replacement Menu
Please select a type of FRU to be replaced.
Where:
When a container is found, the data from that container is printed in a tree
structure as well. Without any arguments, the command prints the FRU
hierarchy and all of the FRUID container data.
The following example prints the FRU tree and lists the available
containers:
XSCF> prtfru -l
/frutree
/frutree/chassis (fru)
/frutree/chassis/CMU#0 (fru)
/frutree/chassis/CMU#0/CMU (container)
/frutree/chassis/CMU#0/CMU/CPUM#0/CPUM (container)
/frutree/chassis/CMU#0/CMU/CPUM#1/CPUM (container)
/frutree/chassis/CMU#0/CMU/CPUM#2/CPUM (container)
/frutree/chassis/CMU#0/CMU/CPUM#3/CPUM (container)
/frutree/chassis/CMU#0/CMU/MEM#0/MEM (container)
/frutree/chassis/CMU#0/CMU/MEM#1/MEM (container)
/frutree/chassis/CMU#0/CMU/MEM#2/MEM (container)
... <output omitted>
/frutree/chassis/CMU#0/CMU/MEM#30/MEM (container)
/frutree/chassis/CMU#0/CMU/MEM#31/MEM (container)
/frutree/chassis/CMU#1 (fru)
/frutree/chassis/CMU#1/CMU (container)
/frutree/chassis/CMU#1/CMU/CPUM#4/CPUM (container)
/frutree/chassis/CMU#1/CMU/CPUM#5/CPUM (container)
/frutree/chassis/CMU#1/CMU/CPUM#6/CPUM (container)
/frutree/chassis/CMU#1/CMU/CPUM#7/CPUM (container)
/frutree/chassis/CMU#1/CMU/MEM#32/MEM (container)
/frutree/chassis/CMU#1/CMU/MEM#33/MEM (container)
/frutree/chassis/CMU#1/CMU/MEM#34/MEM (container)
... <output omitted>
/frutree/chassis/CMU#1/CMU/MEM#62/MEM (container)
/frutree/chassis/CMU#1/CMU/MEM#63/MEM (container)
/frutree/chassis/IOU#0 (fru)
/frutree/chassis/IOU#0/IOU (container)
/frutree/chassis/IOU#0/IOU/IOUA#0/IOUA (container)
/frutree/chassis/IOU#0/IOU/IOUA#2/IOUA (container)
/frutree/chassis/IOU#1 (fru)
/frutree/chassis/IOU#1/IOU (container)
/frutree/chassis/PSU#0 (fru)
/frutree/chassis/PSU#0/PSU (container)
/frutree/chassis/PSU#1 (fru)
/frutree/chassis/PSU#1/PSU (container)
/frutree/chassis/PSU#2 (fru)
/frutree/chassis/PSU#2/PSU (container)
...<output omitted>
/frutree/chassis/PSU#7 (fru)
/frutree/chassis/PSU#7/PSU (container)
/frutree/chassis/PSU#8 (fru)
/frutree/chassis/PSU#8/PSU (container)
/frutree/chassis/FANBP_C#0 (fru)
/frutree/chassis/FANBP_C#0/FANBP_C (container)
/frutree/chassis/FANBP_C#1 (fru)
/frutree/chassis/FANBP_C#1/FANBP_C (container)
/frutree/chassis/FAN_B#0 (fru)
/frutree/chassis/FAN_B#0/FAN_B (container)
/frutree/chassis/FAN_B#1 (fru)
/frutree/chassis/FAN_B#1/FAN_B (container)
/frutree/chassis/FAN_A#0 (fru)
/frutree/chassis/FAN_A#0/FAN_A (container)
/frutree/chassis/FAN_A#1 (fru)
/frutree/chassis/FAN_A#1/FAN_A (container)
/frutree/chassis/FAN_A#2 (fru)
/frutree/chassis/FAN_A#2/FAN_A (container)
/frutree/chassis/FAN_A#3 (fru)
/frutree/chassis/FAN_A#3/FAN_A (container)
/frutree/chassis/FAN_B#2 (fru)
/frutree/chassis/FAN_B#2/FAN_B (container)
/frutree/chassis/FAN_B#3 (fru)
/frutree/chassis/FAN_B#3/FAN_B (container)
/frutree/chassis/FAN_B#4 (fru)
/frutree/chassis/FAN_B#4/FAN_B (container)
/frutree/chassis/FAN_B#5 (fru)
/frutree/chassis/FAN_B#5/FAN_B (container)
/frutree/chassis/FAN_B#6 (fru)
/frutree/chassis/FAN_B#6/FAN_B (container)
/frutree/chassis/FAN_B#7 (fru)
/frutree/chassis/FAN_B#7/FAN_B (container)
/frutree/chassis/BP_A (container)
/frutree/chassis/BP_A/DDC#0/DDC (container)
/frutree/chassis/BP_A/DDC#1/DDC (container)
/frutree/chassis/SWBP#0 (fru)
/frutree/chassis/SWBP0/SWBP (container)
/frutree/chassis/MEDBP#0 (fru)
/frutree/chassis/MEDBP#0/MEDBP (container)
/frutree/chassis/XSCFU_B#0 (fru)
/frutree/chassis/XSCFU_B#0/XSCFU_B (container)
/frutree/chassis/OPNL (container)
The following example displays FRUID data for the CMU container;
notice the use of the fully qualified container name:
XSCF> prtfru -c /frutree/chassis/OPNL
/frutree/chassis/OPNL (container)
AREA NAME:Board_Area
Language: 25
Manufacture_Data_Time: Sat May 25 16:02:30 1996
Manufacturer: FUJITSU LIMITED
Product_name: OPNL
Serial_number: PP0640S619
Part_number: CA06620-D381 A1
AREA NAME:Internal_Area
board_id: 257
fan_high_level0: 27
fan_low_level0: 24
start_over_temp_level0: 36
stop_over_temp_level0: 32
start_under_temp_level0: 1
stop_under_temp_level0: 5
fan_high_level1: 25
fan_low_level1: 22
start_over_temp_level1: 34
stop_over_temp_level1: 30
start_under_temp_level1: 1
stop_under_temp_level1: 5
fan_high_level2: 23
fan_low_level2: 20
start_over_temp_level2: 32
stop_over_temp_level2: 28
start_under_temp_level2: 1
stop_under_temp_level2: 5
fan_high_level3: 21
stop_under_temp_level3: 5
AREA NAME:TroubleInfo_Area
AREA NAME:System_Area
cod_board_info: 0x00
The following example switches the state of the current XSCF to Active:
XSCF> switchscf -t Active
The following example sets the system to use clock board 1 at the next
system startup and then verifies the setting:
XSCF> clockboard -s 1
XSCF> clockboard
current CLKU_B number :0
next CLKU_B number :1
COD Overview
COD Boards
A COD board is a system board that has been configured at the factory for
COD capability. COD boards come in the same configurations as standard
system boards. The number of CPUs per COD board depends on the
server configuration that is purchased.
The purchase of a COD right to use (RTU) license from the Sun License
Center entitles you to receive a license key, which enables the appropriate
number of COD processors.
A license key issued by the Sun License Center is comprised of text lines,
which can be e-mailed and then added to the COD license database. A
single license key can grant access to multiple RTUs, as specified when
the key is generated.
Headroom Management
When you intend to purchase a license but need to use the resources
immediately
By default, COD resources arrive with headroom disabled. You can use
the setcod command to enable headroom. However, if all your COD
resources are already licensed, configuring headroom has no effect. In that
case, you must install additional COD boards to retain your headroom
capacity. You can also reduce or disable headroom at any time.
Command Result
Note If you need to trade a COD license for example, the license is
corrupted or move licenses from one server to another, use the
deletecodlicense to remove the old license and use the addcodlicense
to install the new license.
Managing Headroom
Three procedures can be performed when managing headroom. These are:
Increasing headroom
Decreasing headroom
Disabling headroom
Disabling Headroom
01:84000000:104:0301010100:3:00000000:xxxxxxxxxxxxxxx
Each of the functions listed in the following table provides the same
results as those of the corresponding XSCF shell commands.
Login page XSCF Wed console login page. Log in with an XSCF
user account.
Menu frame This page is displayed by default after login. There
are three navigators available from the menu frame.
They are:
Menu The menu of various settings, operations,
and state displays that are displayed in the tree.
Physical The physical components of the server
are displayed in this tree.
Logical The logical components of each domain
are displayed in the tree.
Main page A generic name of the detailed page located at the
right of the screen. When you select an item from the
menu in the tree frame, the target page is displayed
here.
Event frame A page that displays the entire system events. The
content of the display is refreshed every 60 seconds.
You can change the interval on the same frame.
Status display This page displays the status of the entire system and
the domain status. The page display includes the
External I/O expansion unit status.
Note This exercise has been written with the assumption that you have
remote access to an MX000 server.
Preparation
Your instructor will assign a server to each group. To complete this
exercise, you will need your servers hostname and the network
configuration. Use the Solaris cat command to read the /etc/hosts file
to determine your network settings.
Resource
NTS
IP Address
NTS Port #
User Login user1
User Password cangetin
XSCF0-lan0
IP Address
XSCF0-lan0
netmask
XSCF0-lan1
IP Address
XSCF0-lan1
netmask
Service Processor
host name
9. From the serial connection, view the status of the telnet protocol.
Record the command used:
Command: _______________________________________________
Status: __________________________
10. From the serial connection, re-enable the telnet service, record the
command used, and verify that telnet is again working.
Command: _______________________________________________
Status: __________________________________________________
9. Display the user account information for the user account just
created. Record the command used:
Command: __________________________________________
10. Display the privileges assigned to this user account. Record the
command used:
Command: __________________________________________
11. Configure all user accounts to automatically log out of the XSCF
after two minutes. Record the command used:
Command: __________________________________________
12. Verify the logout setting from step 11. Did the value change? Record
the command used:
Command: __________________________________________
13. Log out and log back in using the same account.
14. Re-verify the previous logout setting Did the value change? Record
the command used:
Command: __________________________________________
15. Allow enough time to pass so that the users session is automatically
logged out.
16. Log in to the XSCFU using the user1 account.
17. Disable the new account you created earlier in this lab. Record the
command used:
Command: __________________________________________
18. Log out of the XSCF.
19. Attempt to log back in to the XSCFU using the account you just
disabled. Was your attempt successful? __________________
20. Log in to the XSCF using the user1 account and re-enable the
disabled account. Record the command used:
Command: _________________________________________
21. Log out of the XSCF and re-login as your new user.
22. Attempt to change the password for this user by only changing two
characters of the original password. Record the command used and
whether you were successful or not and why:
Command: _________________________________________
__________________________________________________
23. As your final step for this task, execute the steps needed to delete
this user from the system. Record your steps:
Command: _________________________________________
__________________________________________________
11. Display the current settings for the servers power-up wait time and
warmup time. Record the command used:
Command: _______________________________________________
12. Identify the current values for the following:
Warmup time: _____________________________
Wait time: ________________________________
13. Change the warmup and wait times to the following values:
Warmup time: 4 minutes
Wait time: 2 minutes
Command: _______________________________________________
14. When would these new values take effect? _______________________
15. Reset the warmup and wait times to their previous settings:
Command: _______________________________________________
16. Display the current settings for the servers shutdown delay time.
Record the command used:
Command: _______________________________________________
Status: _________________________
17. Change the shutdown delay to 30 seconds:
Command: _______________________________________________
18. When would this new value take effect? _______________________
19. Reset the shutdown delay to the previous setting:
Command: _______________________________________________
PSB#00
CPUM#0-CHIP#0
CPUM#0-CHIP#1
CPUM#1-CHIP#0
CPUM#1-CHIP#1
MEMB#0
MEMB#1
MEMB#2
MEMB#3
IOU#0
PSB#01
CPUM#2-CHIP#0
CPUM#2-CHIP#1
CPUM#3-CHIP#0
CPUM#3-CHIP#1
MEMB#4
MEMB#5
MEMB#6
MEMB#7
IOU#1
Status: _______________________________________________
7. Which command would be used to display FRUID data for the
server as well as list all available containers?
Command: _______________________________________________
8. Using this command, display the FRU tree for the operator panel of
your assigned server. Record the command used:
Command: _______________________________________________
9. Identify the values for the following:
Manufacturer: _____________________________
Serial Number: ____________________________
Part Number: _____________________________
10. Perform an initial diagnosis test of a physical system board (PSB)
that has been assigned for your use. Initially use the default
diagnostic level, but if time permits, execute a maximum diagnostic
level as well. Record the command used:
Command: _______________________________________________
Status: _________________________________
Exercise Summary
Exercise Solutions
This section provides solutions to the exercises.
22. Attempt to change the password for this user by only changing two
characters of the original password. Record the command used and
whether you were successful or not and why:
Command: password -- should not be successful due to the setting earlier in
setpasswordpolicy requiring a minimum of four characters to be changed.
23. As your final step for this task, execute the steps needed to delete
this user from the system. Record your steps:
Command: deleteuser joe -- if successful you are deleting your current account and
may just exit you out. To be safe, run these commands:
1. exit
2. Log in as another user with proper privileges, such as
user1.
3. deleteuser joe
Command: showpowerupdelay
12. Identify the current values for the following:
Warm-up time: varies
Wait time: varies
13. Change the warm-up and wait times to the following values:
Warm-up time: 4 minutes
Wait time: 2 minutes
Command: setpowerupdelay -c warmup -s 4
Command: setpowerupdelay -c wait -s 2
14. When would these new values take effect? After a server reboot
15. Reset the warm-up and wait times to their previous settings:
Command: setpowerupdelay -c warmup -s 0
Command: setpowerupdelay -c wait -s 0
16. Display the current settings for the servers shutdown delay time.
Record the command used:
Command: showshutdowndelay
Status: varies
17. Change the shutdown delay to 30 seconds:
Command: setshutdowndelay -s 30
18. When would this new value take effect? A power failure
19. Reset the shutdown delay to the previous setting:
Command: setshutdowndelay -s X
PSB#00
CPUM#0-CHIP#0
CPUM#0-CHIP#1
CPUM#1-CHIP#0
CPUM#1-CHIP#1
MEMB#0
MEMB#1
MEMB#2
MEMB#3
IOU#0
PSB#01
CPUM#2-CHIP#0
CPUM#2-CHIP#1
CPUM#3-CHIP#0
CPUM#3-CHIP#1
MEMB#4
MEMB#5
MEMB#6
MEMB#7
IOU#1
Command: prtfru -l
8. Using this command, display the FRU tree for the operator panel of
your assigned server. Record the command used:
Command: prtfru -c /frutree/chassis/OPNL
9. Identify the values for the following:
Manufacturer: Sun or Fujitsu
Serial Number: varies
Part Number: CA06620-D381, but may vary
10. Perform an initial diagnosis test of a physical system board (PSB)
that has been assigned for your use. Initially use the default
diagnostic level, but if time permits, execute a maximum diagnostic
level as well. Record the command used:
Command: testsb X
Command: testsb -m diag=max X
Status: varies
Module 5
Objectives
Upon completion of this module, you should be able to:
Document the characteristics of a domain
Chose between Uni-mode or Quad-mode for a Physical System
Board (PSB) set
Modify the Domain Component List (DCL)
Allocate Extended System Boards (XSBs) to a domain
Power on a domain
Document the structure of the OpenBoot PROM (OBP) device tree
for the MX000 servers
Access domain information using standard Solaris OS utilities
Relevance
Additional Resources
Exploring Domains
The MX000 mid-range and high-end servers have the ability to run
several instances of the Solaris OS. These separate and independent
instances are called domains.
The MX000 mid-range and high-end server domains have the following
characteristics:
Domains are based on allocated extended system boards (XSBs)
Domains do not depend on each other and are isolated
A panic in one domain does not affect the other domains running in
the server
Domain Implementation
There are two implementations of domain configuration:
Static (inactive)
Dynamic (active)
Domain Configuration
To configure a domain, you must assign system resources, or Extended
System Boards (XSBs), to each domain that you intend to use.
The service processor supports commands that allow you to group XSBs
into domains.
When the configuration has been set and the domain is initialized, the
initialization process isolates a domains resources. This is referred to as a
Dynamic System Domain (DSD).
You can create and delete domains without interrupting the
operation of other domains.
You can use domains for many purposes, such as:
Testing a new Solaris OS update
Setting up a development environment
Setting up a testing environment
You can also configure several domains to support different
departments, with one domain per department
You can also temporarily reconfigure the system into one domain to
run a large job
The DCUs in the MX000 servers consist of XSBs. Before configuring the
XSBs, you should be familiar with the following terms:
Physical System Board (PSB) set Consists of CPU, memory, and I/O
Specific boards are associated with specific with each PSB set
eXtended System Board (XSB)
Is the result of using your PSB in uni-mode (XX-0) with all of
the associated CPUs, memory boards, and IO being assigned to
the XSB
Is the result of using your PSB in quad-mode (XX-0, XX-1, XX-2,
XX-3) with the associated CPUs, memory boards, and IO being
divided among as many as four extents
The numbering scheme is based on the PSB number coming
first, followed by a dash and then the extent number
Logical System Board (LSB) number To create domains, you must
first assign an LSB number to an XSB
There are 16 LSB numbers for every domain
The LSB numbers range from 0 to 15
Domain Configuration
You can create a domain out of any group of XSBs if the following
conditions are met:
The domain must contain at least one XSB with CPU, Memory, and
I/O.
The domain should have at least one network interface.
The domain must have sufficient memory to support the OS and
applications.
At least one boot disk should be connected to the domains I/O. If a
domain does not have its own disk, at least one network interface
must exist so that you can boot the domain from the network.
Note The domain should contain enough physical resources to meet the
requirements of the planned workload. Prior to constructing the domain,
a resource needs assessment should be performed to ensure that the assigned
resources meet these requirements.
The showdcl command displays the current DCL. You can get
information about all of the domains by using the -a option and for a
specific domain by using the -d with the domain_id option.
In the following example, the XSB 00-0 is assigned to LSB 0 and it is part
of domain 0, which is currently powered off.
XSCF> showdcl -a
DID LSB XSB System
00 Powered Off
00 00-0
In the following output of showboards, there are five XSBs. XSB 00-0 is
assigned to domain 0. The remaining four XSBs or Domain Configurable
Units (DCUs) are in the system board pool (SP).
XSCF> showboards -a
XSB DID(LSB) Assignment Pwr Conn Conf Test Fault
---- -------- ----------- ---- ---- ---- ------- --------
00-0 00(00) Assigned n n n Unknown Normal
00-1 SP Unavailable n n n Unknown Normal
00-2 SP Unavailable n n n Unknown Normal
00-3 SP Unavailable n n n Unknown Normal
01-0 SP Unavailable n n n Unknown Normal
In the following example, you can see the domain modes that are set for
domain 0.
XSCF> showdomainmode -d 0
Host-ID :80920ed0
Diagnostic Level :min
Secure Mode :off
Autoboot :off
CPU Mode :auto
The only option or parameter for this command is -a, which turns on
power to every domain that has been completely set up.
*Note*
This command only issues the instruction to power-on.
The result of the instruction can be checked by the "showlogs power".
The only option or parameter for this command is -a, which turns off
power to every domain that is running.
In the following example, we are connected into the serial console and
power is turned off to domain 0.
XSCF> poweroff -d 0
DomainIDs to be powered off:0
Continue? [y|n] :y
Note The power-off command at the OBP can also be used to power off
the domain.
To exit the domain console and return to the XSCF shell, press the Enter
key, and then type a #..
{0} ok
In the following example, the console path is shown for all domains that
can be accessed.
XSCF> showconsolepath -a
User DID ro/rw escape Date
installer 0 rw # Wed Feb 25 21:46:04 2009
I/O devices:
Device Instance name of I/O device
Resource Managed resource name
Usage Description of the instance using resources
Query Results of an off-line inquiry about resources
CPU:
----
DID XSB id state speed ecache
00 00-0 0 on-line 2280 5
00 00-0 1 on-line 2280 5
00 00-0 2 on-line 2280 5
00 00-0 3 on-line 2280 5
Memory:
-------
board perm base domain target deleted remaining
DID XSB mem MB mem MB address mem MB XSB mem MB mem MB
00 00-0 12288 1438 0x000003c000000000 12288
IO Devices:
----------
DID XSB device resource usage
00 00-0 sd0 /dev/dsk/c0t0d0s0 mounted filesystem "/"
00 00-0 sd0 /dev/dsk/c0t0d0s1 swap area
00 00-0 sd0 /dev/dsk/c0t0d0s1 dump device (swap)
00 00-0 bge0 SUNW_network/bge0 bge0 hosts IP addresses: 10.6.15.2
A user with platform administrator privileges can run this command for
all domains. Users with domain administrator privileges can run this for
their domain only.
sendbreak -d domain_id
*Note*
This command only issues the instruction to reset.
The result of the instruction can be checked by the "showlogs power".
{3} ok
After the domain completes POST, the available healthy devices are
identified to the domain. The OpenBoot PROM represents the
interconnected buses and their attached devices as a tree of nodes. This
tree is called the device tree. A node representing the host computers
main physical address bus forms the trees root node.
Nodes without children are called leaf nodes and generally represent
devices. However, some nodes represent system-supplied firmware
services or other objects.
{0} ok show-nets
a) /pci@4,600000/pci@0,1/network@1,1
b) /pci@4,600000/pci@0,1/network@1
c) /pci@0,600000/pci@0,1/network@1,1
d) /pci@0,600000/pci@0,1/network@1
q) NO SELECTION
Enter Selection, q to quit: q
The obp device tree listings for the Processors are bolded below.
The Processor numbers are 400, 408, 410, and 418.
There are two cores, each core having two threads.
The cores are numbered 0 and 1.
The two threads are shown as cpu@0 and cpu@1.
These are dual-core, dual-threaded processors (SPARC 64 VI).
/cmp@418,0/core@1
/cmp@418,0/core@0
/cmp@418,0/core@1/cpu@1
/cmp@418,0/core@1/cpu@0
/cmp@418,0/core@0/cpu@1
/cmp@418,0/core@0/cpu@0
/cmp@410,0/core@1
/cmp@410,0/core@0
/cmp@410,0/core@1/cpu@1
/cmp@410,0/core@1/cpu@0
/cmp@410,0/core@0/cpu@1
/cmp@410,0/core@0/cpu@0
/cmp@408,0/core@1
/cmp@408,0/core@0
/cmp@408,0/core@1/cpu@1
/cmp@408,0/core@1/cpu@0
/cmp@408,0/core@0/cpu@1
/cmp@408,0/core@0/cpu@0
/cmp@400,0/core@1
/cmp@400,0/core@0
/cmp@400,0/core@1/cpu@1
/cmp@400,0/core@1/cpu@0
/cmp@400,0/core@0/cpu@1
/cmp@400,0/core@0/cpu@0
/openprom/client-services
/packages/obp-tftp
/packages/terminal-emulator
/packages/disk-label
/packages/deblocker
/packages/SUNW,builtin-drivers
/packages/SUNW,probe-error-handler
The Solaris prtdiag command shows the LSB number for the allocated
XSB. The four processors and their Agent Identifiers (AIDs) are show as
well. The speed of the processors is 2.15 GHz and L2 cache is 5 MB.
# prtdiag -v
# prtdiag -v
System Configuration: Sun Microsystems sun4u Sun SPARC Enter M8000 Server
System clock frequency: 960 MHz
Memory size: 32768 Megabytes
Processor Numbers
Processor Numbers
LSB (Decimal in the Solaris
(HEX in OBP)
OS)
Processor Numbers
Processor Numbers
LSB (Decimal in the Solaris
(HEX in OBP)
OS)
8 500 (CPU CHIP # 0) 256-263
508 (CPU CHIP # 1) 264-271
510 (CPU CHIP # 2) 272-279
518 (CPU CHIP # 3) 280-287
9 520 (CPU CHIP # 0) 288-295
528 (CPU CHIP # 1) 296-303
530 (CPU CHIP # 2) 304-311
538 (CPU CHIP # 3) 312-319
10 540 (CPU CHIP # 0) 320-327
548 (CPU CHIP # 1) 328-335
550 (CPU CHIP # 2) 336-343
558 (CPU CHIP # 3) 344-351
11 560 (CPU CHIP # 0) 352-359
568 (CPU CHIP # 1) 360-367
570 (CPU CHIP # 2) 368-375
578 (CPU CHIP # 3) 376-383
12 580 (CPU CHIP # 0) 384-391
588 (CPU CHIP # 1) 392-399
590 (CPU CHIP # 2) 400-407
598 (CPU CHIP # 3) 408-415
13 5a0 (CPU CHIP # 0) 416-423
5a8 (CPU CHIP # 1) 424-431
5b0 (CPU CHIP # 2) 432-439
5b8 (CPU CHIP # 3) 440-447
14 5c0 (CPU CHIP # 0) 448-455
5c8 (CPU CHIP # 1) 456-463
5d0 (CPU CHIP # 2) 464-472
5d8 (CPU CHIP # 3) 473-479
15 5e0 (CPU CHIP # 0) 480-487
5e8 (CPU CHIP # 1) 488-495
5f0 (CPU CHIP # 2) 496-503
5f8 (CPU CHIP # 3) 504-511
Now, start with the two I/O controller (Oberons in Figure 5-1) and break
down their structure.
That identifies the first part of the address, /pci@0,600000. The second
part of the address is resolved from the PCI-E to OCI-X bridge chip on the
IOUA card, /pci@0,600000/pci@0 (as shown in Figure 5-2).
These are the internal disks shown in the format command output, so
they are connected to the LSI1064 SAS controller. The path is translated up
through /pci@0,600000/pci@0. There is one SCSI path coming off of
pci@0, scsi@1 and the last piece of information in the string shows you
that they are sd@0,0 and sd@1,0, which are SCSI disks at 0 and 1,
respectively, and both at LUN0. Two disks are accessible from controller 0
and two disks are accessible from controller 1. The full device alias that
you would see both at the OBP and in the Solaris OS is:
/pci@0,600000/pci@0/scsi@1/sd@1,0.
The slots on the physical IOU are numbered 03 on the bottom from left
to right and 47 on the top from left to right, as shown in Figure 5-3.
As with the CPU IDs, the AIDs are going to change for I/O based on your
LSB assignments. In the following example, the device alias is:
/devices/pci@x,y
where x is the portid and y is 600000 for leaf A and 700000 for leaf B.
The portid (x) changes for LSBs 0 through 15 are shown in Table 5-2.
0 /pci@0,600000/
1 /pci@10,600000/
2 /pci@20,600000/
3 /pci@30,600000/
4 /pci@40,600000/
5 /pci@50,600000/
6 /pci@60,600000/
7 /pci@70,600000/
8 /pci@80,600000/
9 /pci@90,600000/
10 /pci@a0,600000/
11 /pci@b0,600000/
12 /pci@c0,600000/
13 /pci@d0,600000/
14 /pci@e0,600000/
15 /pci@f0,600000/
The OBP device pathing structure for the I/O Expansion Unit PCI-E I/O
Boat is shown in Figure 5-5.
Figure 5-5 I/O Expansion Unit PCI-E I/O Boat Device Paths
The I/O Expansion Unit PCI-X I/O Boat which is shown in Figure 5-6 is
also supported.
The OBP device path structure for the I/O Expansion Unit PCI-X I/O
Boat can be seen in Figure 5-7.
Figure 5-7 I/O Expansion Unit PCI-X I/O Boat Device Paths
The Solaris OS
Remember the following about the Solaris OS running on your server:
Each domain has its own, independent copy of Solaris.
The minimum level of the Solaris OS required for a domain is the
Solaris 10 OS, update 4 (08/07). The best performance comes with
Solaris 10 update 7 (05/09).
Remember to install the recommended patches.
All applications that are not platform-dependent should run in a
domain without requiring change.
Because a domain does not have a keyboard, monitor, or serial port,
it is controlled through a console window from the service processor.
The Solaris OS detects the service processor as interface ttya.
Use the svcs command to verify the listed services are online. The
svcadm command is used to enable or disable services. Enable the dscp
service first. Check your network configuration using ifconfig. If the
sppp0 interface does not exist, that is an indication that the dscp service is
NOT enabled.
godzilla-dom0 # ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu
8232 index 1
inet 127.0.0.1 netmask ff000000
bge0: flags=1000803<UP,BROADCAST,MULTICAST,IPv4> mtu 1500 index 2
inet 10.6.15.30 netmask ff000000 broadcast 10.255.255.255
ether 0:b:5d:e0:5:1a
godzilla-dom0 # ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu
8232 index 1
inet 127.0.0.1 netmask ff000000
bge0: flags=1000803<UP,BROADCAST,MULTICAST,IPv4> mtu 1500 index 2
inet 10.6.15.30 netmask ff000000 broadcast 10.255.255.255
ether 0:b:5d:e0:5:1a
sppp0:
flags=10010008d1<UP,POINTOPOINT,RUNNING,NOARP,MULTICAST,IPv4,FIXEDMTU>
mtu 1500 index 4
inet 192.168.224.2 --> 192.168.224.1 netmask ffffff00
ether 0
Note This exercise has been written with the assumption that the
student will have remote access to an MX000 server.
Preparation
To prepare for this exercise:
Your instructor will assign a server to each group. Your assigned server is:
_________________________________________________________________
13. At the obp, determine how many network interfaces your domain
has. Record the command used:
Command:___________________________________________________
14. At the obp, determine how many processors your domain has.
Record the command used:
Command:___________________________________________________
15. How many processors you have for domain 0? Record the command
used:
Command:___________________________________________________
16. Boot to Solaris. Record the command used:
Command:___________________________________________________
17. Verify your network interfaces. Record the command used:
Command:___________________________________________________
18. Verify your LSB #s. Record the command used:
Command:___________________________________________________
19. Determine what SBs are connected and configured. Record the
command used:
Command:___________________________________________________
20. Halt your domain. Record the command used:
Command:___________________________________________________
21. Return to the service processor XSCF prompt. Record the command
used:
Command:___________________________________________________
22. Power off domain 0. Record the command used:
Command:___________________________________________________
23. Delete XSB 00-0. Record the command used:
Command:___________________________________________________
24. Remove LSB #0. Record the command used:
Command:___________________________________________________
Command:___________________________________________________
13. Determine what SBs are connected and configured. Record the
command used:
Command:___________________________________________________
14. Halt your domain. Record the command used:
Command:___________________________________________________
15. Return to the service processor XSCF prompt. Record the command
used:
Command:___________________________________________________
16. Power off domain 0. Record the command used:
Command:___________________________________________________
17. Delete XSB 00-0. Record the command used:
Command:___________________________________________________
18. Remove LSB #0. Record the command used:
Command:___________________________________________________
Exercise Summary
Exercise Solutions
This section provides solutions to the exercises.
PSB#00
CPUM#0-CHIP#0
CPUM#0-CHIP#1
CPUM#1-CHIP#0
CPUM#1-CHIP#1
MEMB#0
MEMB#1
MEMB#2
MEMB#3
IOU#0
PSB#01
CPUM#2-CHIP#0
CPUM#2-CHIP#1
CPUM#3-CHIP#0
CPUM#3-CHIP#1
MEMB#4
MEMB#5
MEMB#6
MEMB#7
IOU#1
14. At the obp, determine how many processors your domain has.
Record the command used:
Command: show-devs
15. How many processors you have for domain 0? Record the command
used:
Command: four, show-devs
16. Boot to Solaris. Record the command used:
Command: boot
17. Verify your network interfaces. Record the command used:
Command: ifconfig -a
18. Verify your LSB #s. Record the command used:
Command: prtdiag -v
19. Determine what SBs are connected and configured. Record the
command used:
Command: cfgadm -a
20. Halt your domain. Record the command used:
Command: init 0
21. Return to the service processor XSCF prompt. Record the command
used:
Command: #.
22. Power off domain 0. Record the command used:
Command: poweroff -d 0 -y
23. Delete XSB 00-0. Record the command used:
Command: deleteboard -c unassign 00-0
24. Remove LSB #0. Record the command used:
Command: setdcl -d 0 -r 0
Command: prtdiag -v
13. Determine what SBs are connected and configured. Record the
command used:
Command: cfgadm -a
14. Halt your domain. Record the command used:
Command: init 0
15. Return to the service processor XSCF prompt. Record the command
used:
Command: #.
16. Power off domain 0. Record the command used:
Command: poweroff -d 0 -y
17. Delete XSB 00-0. Record the command used:
Command: deleteboard -c unassign 00--0
18. Remove LSB #0. Record the command used:
Command: setdcl -d 0 -r 0
Module 6
Objectives
Upon completion of this module, you should be able to:
Document the purpose and benefits of Dynamic Reconfiguration
(DR)
Analyze the locations from which DR can be performed
Discuss the concepts of DR in preparation for attaching and
detaching XSBs
Describe I/O restrictions and concerns in regards to DR
Describe memory restrictions and concerns in regards to DR
Describe CPU restrictions and concerns in regards to DR
Describe Extended System Board (XSB) considerations and directives
Perform DR using the Service Processor to add or remove an XSB
from a domain
Relevance
Additional Resources
Note The MX000 servers XSBs may consist of CPU, Memory, and I/O.
Its important to know your domain configuration including expected
service levels before performing the removal of an XSB.
Benefits of DR
Dynamic Reconfiguration allows you to:
Dynamically configure XSBs into domains
Dynamically unassign XSBs from domains
Dynamically move XSBs between domains
Display XSB status
Hot-plug individual PCI cards from a domain using the Solaris
cfgadm command
DR Operational Locations
DR operations can be performed from two locations:
From the service processor
From the domain
Domain
The cfgadm command can be run directly from the domain to get
resource status and to dynamically reconfigure PCI adapters only. The
cfgadm command was first introduced for the previous Sun Enterprise
server line of platforms.
Note In the MX000 server line, the cfgadm command does not support
DR operations for XSBs.
DR Concepts
DR lets you perform maintenance on an active domain by disconnecting
and then reconnecting XSBs without bringing the domain down. In
addition to maintenance on the high-end servers, DR also provides
enhanced performance by allowing XSB addition (thus more processors,
memory, and I/O) at periods when resource requirements are high.
The same applies to network controllers. The XSB that hosts the primary
network interface that connects to the domain cannot be detached unless
multiple network paths exist to the same network using another XSB in
the same domain.
In addition to planning for the XSB resource changes, you must also make
sure that the XSB is in the domain component list for whichever domain
or domains the XSB will be added to or removed from. This can be
verified from the output of the showdcl -a command. If a XSB needs to
be added to a domains component list prior to a DR operation, use the
setdcl command to do this.
DR attach Operation
The DR attach operation adds the specified XSB to a domain running the
Solaris OS. When the XSB is added and the XSBs resources and interfaces
are configured to the domain, the domain uses them without any
difficulty. The steps for an attach operation include:
Assign the XSB.
Configure the XSB.
DR attach States
When performing a DR attach operation, two levels of state change are
supported:
Assign When assign is specified, the DR software reserves the XSB
for the specified domain. At this point the XSB cannot be configured
in or assigned to other domains.
The assigned XSB is configured in the domain by a reboot or
execution of the addboard command with the configure option.
Reserve When reserve is specified, the DR software reserves the
XSB for the specified domain. At this point the XSB cannot be
configured in or assigned to other domains.
The assigned XSB is configured in the domain by a reboot or
execution of the addboard command with the configure option.
Configure When configure is specified, the DR software performs
hardware-specific operations that allow an XSBs resources for use
by the Solaris OS. XSBs that are configured are part of the domain
configuration and are available for manipulation by the Solaris OS
software device manipulation maintenance commands, such as
psradm, mount, and ifconfig.
When the configure operation is complete, the XSB in question is
recognized by the Solaris OS.
If you are attaching an XSB with I/O devices you might still have to run
the devfsadm command manually to create new device files.
DR detach Operation
A DR detach operation removes the specified XSB from a domain, which
must be already running the Solaris OS.
DR detach States
When performing a DR detach operation, three levels of state change are
supported:
Unassign When unassign is specified, the DR software completely
disconnects the XSB from the domain configuration and puts it in the
available system pool where it can be incorporated into other
domain configurations.
Disconnect This changes the receptacle state to disconnected. If the
occupant state is configured, the disconnect function first attempts to
unconfigure the XSB. The disconnect function powers the XSB off by
default. The disconnect function powers off a PCI Card which can
now be removed from its PCI slot. Because the XSB remains
assigned to the domain, it can be configured again by a domain
reboot or the execution of the addboard command.
Reserve When reserve is specified, the DR software does not
immediately disconnect the XSB from the domain configuration but
only reserves detachment. When the domain is powered off or
rebooted, the reserved XSB is disconnected and unassigned from the
domain and the XSB is placed in the system pool.
Both Sun StorageTek Traffic Manager software and IPMP are key
components of DR operations on XSBs with I/O components. If the
domains vital system resources are multipath configured, then a DR
operation can be performed on one of the physical paths without
interrupting access to the device.
If the driver does not detach or suspend, you must unload the device
driver using the modunload command.
When you have identified the driver to remove from memory, run the
modunload command to delete it from the kernel. Use the modinfo
command to get the device drivers ID number. It is not always the same
and is not the driver major number from the /etc/name_to_major file.
1. Run the modinfo command to get the driver ID. The driver ID is the
first number in the modinfo command output; in this case, 307.
# modinfo | grep -i joe
307 f66a0000 dfe9 33 1 jn (Joes Non-compliant driver, v. 0.6)
2. Run the modunload command, specifying the driver number
from the modinfo command.
# modunload -i 307
Note By default, the kernel is caged on the XSB with the lowest LSB #.
Quiescence
Note Permanent memory can only be relocated to another XSB with the
same amount of memory or a greater amount of memory.
Swap Space
Configure system swap space as multiple partitions or files on disks
attached to controllers owned by different XSBs. This allows any swap
partition or file to be easily replaced with the swap command. It also has a
side benefit of providing better swap performance because any swap load
is spread over several I/O controllers.
The amount of additional swap space that you need is equal to the
amount of main storage on two domain XSBs. To be able to handle every
case, you must plan to use the largest memory amount on any domain
XSB.
Depending on how short of swap space you are, the DR operation might
fill all available swap space and hang the DR operation.
Ensure that you have enough space in the new primary swap partition
(and in the /var directory) to contain a full domain panic dump.
Note DR does not verify whether enough swap space is available before
starting the detach operation. You can use DR to determine how much
memory must be drained from the XSB and then use the swap -l
command to determine if the current amount of system swap space is
sufficient.
CPU Considerations
Two conditions can make it impossible for you to detach an XSB related
to the CPUs themselves:
A running process has threads bound to a CPU being removed.
A processor being removed is the last in its processor set.
Bound Threads
Process threads can be programatically or administratively bound to a
particular CPU. That is, they run on only that particular CPU. A CPU that
has any bound threads cannot be removed through a DR operation.
You can use the pbind command to list any processes that have bound
threads and to unbind those threads if you need to remove a particular
CPU:
# pbind
process id 40848: 0
# pbind -u 40848
process id 40848: was 0, now not bound
Processor Sets
Threads can be bound to processor sets rather than individual processors.
By default, no processor sets exist, but they can be created by the root
user.
The default policy is fru. If any resource for the XSB fails post, then the
domain will not be able to use that resource.
If you change the value of the policy to xsb, any post failure of an XSB
resource causes the entire XSB to be unusable.
If you change the value of the policy to system, any failure during post
causes the entire domain to fail post.
The floating board option allows you to control if an XSB can have
permanent memory.
Note An XSB with the float option set to true can never hold the kernel
cage. The only exception to the rule is: If all XSBs are set to float (static
domain creation), the kernel must get caged somewhere.
The value of this option can be set to true to enable the floating board
setting or false to disable it with setdcl -s float=value LSB#. The
default value is false. An XSB that has this option set to true is called a
floating board.
Omit-Memory Option
When the omit-memory option is enabled, the memory on an XSB cannot
be used in the domain. This option can be used when the domain needs
the XSBs CPU resources and not the memory. It will make the attach
operation much faster because the XSBs memory will not be tested.
Note This option will not allow unmounted XSBs with no associated
memory to be used. The associated memory must exist to be omitted.
The value of this option can be set true to omit memory or false to not
omit memory on the XSB with the setdcl -s no-mem=value LSB#. The
default value is false.
Omit-I/O Option
The omit-I/O option disables the PCI cards, disk drives, and network
ports on an XSB to prevent the target domain from using them. You set
the omit-I/O option to true when the target domain only needs to use
the XSBs CPU and memory. Having the option set to false allows the
target domain to use all of the I/O resources on the XSB.
The value of this option can be set to true to omit the I/O units or false
to not omit them with setdcl -s no-io=value LSB#. The default value
is false.
To verify the XSB settings and configuration, use the showdcl command
with the verbose option.
XSCF> showdcl -av
DID LSB XSB Status No-Mem No-IO Float Cfg-policy
00 Powered Off FRU
00 00-0 False False False
01 00-1 True True True
02 01-0 True True True
03 01-1 True True True
XSCF Commands
Before a DR operation can be performed, the administrator must have the
proper privileges to perform the DR operation. The following commands
perform a DR operation from the XSCF softwares CLI:
The addboard command Dynamically configures, attaches, and
incorporates an XSB into the specified domain.
The deleteboard command Dynamically unconfigures, detaches,
and removes an XSB from the specified domain.
The moveboard command Dynamically unconfigures, detaches,
and removes an XSB from the specified domain and dynamically
configures, attaches, and incorporates this XSB into another specified
domain.
The showdcl and setdcl commands Displays and allows you to
set the current domain component list. XSBs must be in a domains
DCL prior to executing a DR operation.
The showdomainstatus command Displays the domains current
status and should be used both before and after DR operations.
The showboards command Displays XSB information and should
be used both before and after DR operations.
The showdevices command Displays device information
including CPUs, memory, and PCI card information and should be
used before a DR detach operation.
The showfru and setupfru commands Displays and allows you to
modify XSB configuration. They should be used before DR
operations if you need to change the XSBs configuration.
In the following example, XSB 01-0 is in the system pool (SP) and is not in
the domain component list for any domain.
XSCF> showboards -a
XSB DID(LSB) Assignment Pwr Conn Conf Test Fault
---- -------- ----------- ---- ---- ---- ------- --------
00-0 00(00) Assigned y y y Passed Normal
01-0 SP Unavailable n n n Unknown Normal
XSCF> showdcl -a
DID LSB XSB Status
00 Running
00 00-0
In the next set of examples, XSB 01-0 is added to the domain component
list for domain 0 and its status is verified.
XSCF> setdcl -d 0 -a 1=01-0
XSCF> showdcl -a
DID LSB XSB Status
00 Running
00 00-0
01 01-0
The XSB now has an LSB # but is still not allocated to domain 0.
XSCF> showboards -a
XSB DID(LSB) Assignment Pwr Conn Conf Test Fault
---- -------- ----------- ---- ---- ---- ------- --------
00-0 00(00) Assigned y y y Passed Normal
01-0 SP Unavailable n n n Unknown Normal
In this next example: XSB 01-0 is currently in the system pool and is not
yet assigned to domain 0. When you execute the addboard command, the
configure function also performs the assign function if it has not been
performed yet.
XSCF> addboard -d 0 -c configure 01-0
XSB#01-0 will be configured into DomainID 0. Continue?[y|n] :y
Initial diagnosis started. [1800sec]
0..... 30..... 60..... 90.....120.....150.....180.....210.....end
Initial diagnosis has completed.
Start configuring XSB to domain.
Configured XSB to domain.
XSCF> showboards -a
XSB DID(LSB) Assignment Pwr Conn Conf Test Fault
---- -------- ----------- ---- ---- ---- ------- --------
00-0 00(00) Assigned y y y Passed Normal
01-0 00(01) Assigned y y y Passed Normal
XSCF> showboards -a
XSB DID(LSB) Assignment Pwr Conn Conf Test Fault
---- -------- ----------- ---- ---- ---- ------- --------
00-0 00(00) Assigned y y y Passed Normal
01-0 SP Available y n n Passed Normal
XSCF> showdcl -a
DID LSB XSB Status
00 Running
00 00-0
01 01-0
To move an XSB between domains, the XSB needs to be part of the DCL
for both the source and destination domains. Below, the XSB is being
added to the DCL for domain 1. Notice that domain 1 (DID 01) is
currently powered off.
XSCF> showdcl -a
DID LSB XSB Status
00 Running
00 00-0
01 01-0
---------------------------
01 Powered Off
00 01-0
Next, the XSB is moved out of its current domain (DID 00) and is assigned
to domain 1.
XSCF> moveboard -d 1 -c assign 01-0
XSB#01-0 will be assigned to DomainID 1 immediately. Continue?[y|n] :y
Start unconfiguring XSB from domain.
Unconfigured XSB from domain.
XSB power off sequence started. [1200sec]
0.end
Operation has completed.
You can see in the showboards output that XSB 01-0 is now assigned to
domain 1 (DID 01) but it is not yet connected or configured.
XSCF> showboards -a
XSB DID(LSB) Assignment Pwr Conn Conf Test Fault
---- -------- ----------- ---- ---- ---- ------- --------
00-0 00(00) Assigned y y y Passed Normal
01-0 01(00) Assigned y n n Passed Normal
Preparation
You will need to know the current system configuration. If your domain
is built with a uni-mode XSB, halt the domain, power off the domain, and
remove the XSB from the domain. Remove all LSB entries that exist.
Exercise Summary
Exercise Solutions
This section contains solutions for the exercise.
11. Reserve XSB 00-1 for detachment. Record the command used:
Command: deleteboard -c reserve 00-1
12. Did the reserve of XSB 00-1 succeed? Record the command used to
verify the reservation:
Command: showboards -av
13. Remove XSB 00-1 from domain 0 and assign XSB 00-1 to domain 1
without halting Solaris. Record the command used:
Command: deleteboard -c unassign 00-1
and
Command: addboard -d 1 -c assign 00-1
or
Command: moveboard -d 1 -c assign 00-1
14. Verify XSB 00-0 and 00-1 are assigned to two different domains.
Record the command used:
Command: showboards -a
15. Remove XSB 00-1 from domain 1 and configure XSB 00-1 into
domain 0. Record the command used:
Command: deleteboard -c unassign 00-1
and
Command: addboard -d 0 -c configure 00-1
or
Command: moveboard -d 0 -c configure 00-1
16. Verify XSB 00-0 and 00-1 are assigned to the same domain. Record
the command used:
Command: showboards -a
Module 7
Objectives
Upon completion of this module, you should be able to:
Configure log archiving for the service processor
Collect log data for the service processor
Configure auditing for the service processor
Collect audit data for the service processor
Collect log and audit data using the snapshot command
View logs on the service processor
Collect fault data from the service processor
List current FRU replacement guidelines
Use Sun Explorer scripts to gather domain information
Relevance
Additional Resources
Basic Features
Log archiving provides the following features:
Provides a remote archive host to be used as a backing store for
service processor logs
When enabled, copies existing log data to the archive host
Continuously archives new log data
Encrypts all connections established between the service processor
and archive host
When enabled, log archiving periodically uses the secure copy program
(scp) to transfer new log data to the archive host. Log archiving also uses
secure shell (ssh) to monitor the disk space consumed by archives. It
deletes older archives when necessary, so that the space consumed by the
archives never exceeds the user-configurable archive space limits.
Note For security reasons, log archiving does not automatically delete
audit log archives. You can, however, still manually delete audit log
archives that are no longer needed.
Encryption
All connections established through log archiving are encrypted. The log
archiving feature provides the ability to use an RSA public key to
authenticate the archive host. The public key is configured and managed
on the service processor.
Note After tests indicate that the archive host is set up correctly, log
archiving is enabled effective immediately. If the tests fail, you receive an
error message that log archiving was not enabled.
Auditing
An MX000 server logs all service processor events that could be relevant
to security, such as system startup and shutdown, user login and logout,
and privilege changes. These events are tracked and implemented
through:
Audit records
Audit trails
Audit events
Audit classes
Audit policy
Audit Records
Audit files are stored in binary format, although you can export them to
XML. The audit file system switches storage between two partitions.
Audit records are stored in one partition until it becomes full, then new
records are stored in the other partition. Records in a full partition can be
moved to a remote location, depending on the audit policy.
Audit Trails
A collection of audit records that are linked is called an audit trail. An
audit trail can reveal suspicious or abnormal patterns of system behavior,
in addition to identifying which user was responsible for a particular
event.
Audit Events
Audit events can be any of the following types:
Changes to the service processor configuration, such as the IP
address
Requests to perform an operation on an object protected by the
access control policy
Tests of password strength
Modifications to the access control attributes associated with an
object for example, which domain a board is associated with
Changes made to user security attributes for example, password or
privileges
Reading information from the audit records (including unsuccessful
attempts)
Modifications to the audit policy
Actions taken caused by exceeding the audit trail size threshold
Actions taken due to audit storage failure
Modifications made by administrators to the audit trail
Changes to the system time
Recorded Data
Audit Classes
Audit classes are categories for grouping and sorting audit events. The
MX000 servers provide a predefined set of audit classes for example,
login events and service-related events. You cannot define additional
audit classes or change the events in a class.
Note Refer to the setaudit man page for a list of audit classes.
Audit Policy
The audit policy determines how the auditing feature is implemented by
configuring the follows variables:
Whether auditing is enabled or disabled
The types of events that are audited
Which users have their events audited
Which remote directories are used for storing audit records
The threshold of local capacity at which a warning is issued
What action to take when both audit partitions are full
The default audit policy that is configured on the service processor is:
Auditing is enabled.
Records are dropped and counted when the audit trail is full.
All events are enabled for auditing.
The global user audit policy is enabled.
Per-user audit policy for all users is enabled.
Audit warning thresholds are set at 80 percent and 100 percent full.
Email warnings are disabled.
The following example changes the audit recording for various classes:
XSCF> setaudit -c LOGIN,AUDIT=disable
Auditing for LOGIN and AUDIT classes has been disabled.
The following example displays audit records for February 19, 2009:
XSCF> viewaudit -D 20090219
file,1,2009-02-19 11:12:24.528 -04:00,20090219151224.0000000000.kong-sp0
The following example displays user audit records for a user named sam:
XSCF> viewaudit -u sam
file,1,2009-02-19 11:12:24.528 -05:00,20090219151224.0000000000.kong-sp0
header,37,1,login - telnet,kong-sp0,2009-02-19 11:31:09.659 -05:00
subject,1,sam,normal,ssh 45880 kong-sp0
command,showuser
platform access,granted
return,0
snapshot -h
Operational Overview
The snapshot command transfers collected data to a specified
destination. This is done by:
1. Opening an output file, the name of which is automatically
generated based on the service processor host name, service
processor IP address, and the UTC time (in hours, minutes, and
seconds) and date at the time the snapshot is invoked. For example:
kong-sp0_10.6.15.41_2009-01-31T01-08-36.zip
Modes of Operation
The snapshot command can be operated in one of the following modes:
SSH target mode
USB device mode
The first mode is SSH target mode. The data collector runs in this mode
when it is invoked with the -t or -T option. In this mode, the data
collector opens an SSH connection from the service processor to the
specified target (after appropriate authentication) and sends the zipped
data archive through the SSH connection to the target host. The
transmission encryption in this mode is provided by SSH.
The second mode is USB device mode. The data collector runs in this
mode when it is invoked with the -d flag. In this mode, the data
collectors output (which is the zip archive) is saved in a file on a USB
device. No transmission encryption occurs in this mode because the data
stays local to the service processor.
The following example displays an error log in detail for the times of the
specified timestamp (-v).
XSCF> showlogs error -p Feb2211:50:462009 -v
Date: Feb 22 11:50:46 EST 2009 Code: 60000500-ffff0000-
0300000800030000
Status: Warning Occurred: Feb 22 11:50:45.621 EST 2009
FRU: /UNSPECIFIED
Msg: Externally initiated reset occurred
Diagnostic Code:
ffffffff ffff0000 00000000
58495200 00000000 00000000 00000000
00000000 00000000 00000000 00000000
UUID: 2aa9e7b7-ed1c-44eb-ba93-9d7ba1f2c6fd MSG-ID: SCF-8000-P5
The following example displays a power log for the date range for
February 22, 2009 from 1:26 PM to 1:27 PM:
You must have one of the following privileges to run this command:
platadm, platop, or fieldeng.
The FMA implementation on the service processor includes the use of the
following commands to collect and view fault data:
fmadm Views and configures system parameters
fmdump Displays the list of faults detected by the FMA
fmstat Displays statistical information about faults
The fmadm utility can be used to view the resources on the system
believed to be faulty. The fmstat utility can be used to report statistics
kept by Fault Manager.
When possible, the Fault Manager daemon initiates steps to self-heal the
failed component and take the component offline. The daemon also logs
the fault to the syslog daemon and provides a fault notification with a
message ID (MSGID).
Note You can use the message ID to get additional information about
the problem from Sun's knowledge article database at:
http://www.sun.com/msg/
The options and parameters for the fmadm command are shown in
Table 7-2.
The following example displays the fault report for the specified UUID.
Find the domain id number in the following output and notice the state of
the maintenance-action-required output:
XSCF> fmdump -V -u 2aa9e7b7-ed1c-44eb-ba93-9d7ba1f2c6fd
TIME UUID MSG-ID
Feb 22 11:50:46.7130 2aa9e7b7-ed1c-44eb-ba93-9d7ba1f2c6fd SCF-8000-P5
nvlist version: 0
version = 0x0
class = list.suspect
uuid = 2aa9e7b7-ed1c-44eb-ba93-9d7ba1f2c6fd
code = SCF-8000-P5
diag-time = 1203699046 712677
de = (embedded nvlist)
nvlist version: 0
version = 0x0
scheme = fmd
authority = (embedded nvlist)
nvlist version: 0
version = 0x0
product-id = SPARC Enterprise M5000
chassis-id = BCF0710015
server-id = kong-sp0
(end authority)
mod-name = sde
mod-version = 1.15
(end de)
fault-list-sz = 0x1
fault-list = (array of embedded nvlists)
(start fault-list[0])
nvlist version: 0
version = 0x0
class = upset.chassis.SPARC-Enterprise.cpu.SPARC64-VI.fe
certainty = 0x64
fru = (embedded nvlist)
nvlist version: 0
version = 0x0
scheme = hc
hc-root =
scf_error_log = 0000000000000000
detected-by = OpenBoot PROM
maintenance-action-required = false
location = CHASSIS
(end fault-list[0])
fault-status = 0x0
__ttl = 0x1
__tod = 0x47befd66 0x2a7fa768
The following example displays part of the contents of the MX000 Fault
Manager messages:
XSCF> fmdump -m -M
MSG-ID: SCF-8000-P5, TYPE: Upset, VER: 1, SEVERITY: Critical
EVENT-TIME: Fri Feb 22 11:50:46 EST 2009
PLATFORM: SPARC Enterprise M5000, CSN: BCF0710015, HOSTNAME: kong-sp0
SOURCE: sde, REV: 1.15
EVENT-ID: 2aa9e7b7-ed1c-44eb-ba93-9d7ba1f2c6fd
DESC: A hard-to-diagnose problem was detected by the CPU chip.
Refer to http://www.sun.com/msg/SCF-8000-P5 for more information.
AUTO-RESPONSE: Nothing is deconfigured.
IMPACT: The domain may drop to the OpenBoot PROM 'OK' prompt.
REC-ACTION: The platform administrator should check to see if any other
hardware failure has occurred on the platform.
If the problem occurs while running OpenBoot PROM, then the platform
administrator should check to make sure that
the OpenBoot PROM environment variables have not been set incorrectly. If
neither of these courses of action
provide a resolution, then the platform administrator should schedule a
service action.
Please consult the detail section of the knowledge article for additional
information.
The fmdump command with the -e (error) option is useful tool that will
show the error messages for problems that has occurred. The error
message itself might not be enough to determine the problem. The
timestamps from fmdump -e will match the timestamps from fmdump.
Use fmdump to discover the UUID and then run fmdump -v -u UUID
for detailed information.
XSCF> fmdump -e
TIME CLASS
Jan 30 18:32:46.2312 ereport.chassis.SPARC-Enterprise.xscfu.process-down
Jan 30 20:52:13.2032 ereport.chassis.SPARC-Enterprise.xscfu.process-down
If the -m option is not present, fmstat reports the following statistics for
each of its client modules:
If there seems to be one suspected FRU based on the fault data collected:
If this is the first occurrence of the fault within a 30-day period, then
replace the first suspected FRU on the list.
If this is the second occurrence of the same fault within a 30-day
period, then check for additional information because there is a
slight probability of another faulty component contributing to this
fault.
If this is the second occurrence of the same fault after a 30-day
period, then replace the first suspect on the list.
If there seems to be more than one suspected FRU based on the fault data
collected:
If this is the first occurrence of the fault within a 30-day period, then
replace the first suspect on the list.
If this is the second occurrence of the same fault within a 30-day
period, then replace the second suspect on the list.
If this is the third occurrence of the same fault within a 30-day
period, then replace the third suspect on the list if one exists, or
check for additional information because there is a slight probability
of another faulty component contributing to this fault.
If this is the second occurrence of the same fault after a 30-day
period, then replace the first suspect on the list.
Note The dumpconfig command will not save service processor log
information.
Caution The following commands should not be done when local access
to the service processor is not possible.
Caution The instructor does not have remote power control to resolve
power issues.
Caution Running the following commands will END your lab for the
rest of the class.
Caution The following commands should not be done when local access
to the service processor is not possible.
Caution The instructor does not have remote power control to resolve
power issues.
Caution Running the following commands will END your lab for the
rest of the class.
The first mode, xscfu, allows for log, audit, and fault information to be
removed.
The second mode of operation, factory, returns the service processor to its
original unconfigured state. Log, audit, and fault information is removed
as well as any configuration for the service processor; this includes
network and domain information.
Continue?[yes/no](default no):no
If you answer "yes" this command will HALT the XSCFU when it completes.
You will need to power cycle the system after the XSCF BOOT STOP.
Continue?[yes/no](default no):no
The initialization of XSCF will be started.
XSCFU : all data clear
OPNL : not clear
XSCF will be automatically rebooted. Afterwards, XSCF will be
initialized.
Continue?[yes/no](default no):no
Continue?[yes/no](default no):no
You must check the following points.
If you answer "yes" this command will HALT the XSCFU when it completes.
You will need to power cycle the system after the XSCF BOOT STOP.
Continue?[yes/no](default no):no
The initialization of XSCF will be started.
XSCFU : all data clear
OPNL : all data clear (exclude SYSTEM ID data)
XSCF will be automatically rebooted. Afterwards, XSCF will be
initialized.
Continue?[yes/no](default no):no
You can get Sun Explorer software through the SunSolveSM Online
service.
Note One of the most important aspects of Sun Explorer software is the
proper entry of information during installation. Sun Explorer software
asks a series of questions about the system serial number, contract ID, and
company name. The answers to these questions should be entered
properly to avoid confusion.
5. Use the following command when running the Sun Explorer utility
for the first time in a domain:
# /opt/SUNWexplo/bin/explorer -g
6. Use the following command to run the Sun Explorer utility:
# /opt/SUNWexplo/bin/explorer -w default,xscfextended
Note This process places the snapshot in the /tmp directory of the
archive host.
The following example shows the contents of the README file found in the
Sun Explorer directory.
# cat README
=========== SUN(TM) EXPLORER DATA COLLECTOR (Version 5.13) ===========
=========== Patch level:
Synopsis
========
Customer Name: Sun
Contract ID: Gary
Email results to: gary.r@sun.com
Subject: explorer.843a9ae1.kong-dom0-2009.02.23.06.17
Modules selected: default,xscfextended
Command Line: /opt/SUNWexplo/bin/explorer -w default,xscfextended
License Agreement: Sun Feb 22 20:08:11 GMT 2009
The following info was collected using UNIX tools (hostname, arch, etc)
----------------------------------------------------------------------
Date: 2009.02.23.06.17
Finished: 2009.02.23.06.21
Hostname: kong-dom0
Hostid: 843a9ae1
Release: 5.10
Kernel architecture: sun4u
Application architecture: sparc
Domain:
Kernel version: SunOS 5.10 Generic 137137-09
System Type: SUNW,SPARC-Enterprise
Uptime: 1:17am up 9:28, 1 user, load average: 0.29, 0.06, 0.02
Swap-s:
total: 193656k bytes allocated + 30592k reserved = 224248k used, 31011808k
available
Swap-l:
swapfile dev swaplo blocks free
/dev/dsk/c0t0d0s1 32,1 16 34801904 34801904
Vmstat:
kthr memory page disk faults cpu
r b w swap free re mf pi po fr de sr s0 s1 s3 s4 in sy cs us sy id
0 0 0 30689144 15284992 8 28 5 0 0 0 12 1 -0 1 0 337 174 129 0 0 100
#
Note For more detailed information regarding the use of Sun Explorer,
see the Sun Explorer Frequently Asked Questions located at:
http://docs.sun.com/app/docs/doc/819-6614
Note This exercise has been written with the assumption that you have
remote access to an MX000 server.
Preparation
To complete this exercise, you need your assigned non-global zones
hostname and the network configuration.
Exercise Summary
Exercise Solutions
This section provides answers to the exercises.