Professional Documents
Culture Documents
AIX 5.3
Virtualization Updates & Tools
mpstat Example CPU user & sys values are relative to physical consumed
mpstat –s
# mpstat -s
Physical Processor / Virtual Process Busy – with SMT enabled each physical process has
proc
two logical processors.
Logical CPU number and the overall busy percentage, which is the sum or user + system
cpu
mode utilization. Gives the relative SMT split between processors.
lparstat Review
root@sq09.dfw.ibm.com /tmp # lparstat -h 1 4
System configuration: type=Shared mode=Capped smt=On lcpu=4 mem=4096 psize=2 ent=0.40
%user %sys %wait %idle physc %entc lbusy app vcsw phint %hypv hcalls
Additional information when
----- ---- ----- ----- ----- ----- ------ --- ---- ----- ----- ------ “-h” flag is specified
84.9 2.0 0.2 12.9 0.40 99.9 27.5 1.59 521 2 13.5 2093
86.5 0.3 0.0 13.1 0.40 99.9 25.0 1.59 518 1 13.1 490
%user / %sys / Shows the percentage of the entitled processing capacity used. So, you would say that the system is consuming
%wait / %idle 86.9% (84.9 + 2) of four-10th of a physical processor. For dedicated partitions, the entitled capacity = # of physical
processors
physc Shows the number of physical processors consumed. For a capped partition this number will not exceed the entitled
capacity. For an uncapped partition this number could match the number of processors in the shared pool; however,
this my be limited based on the number of on-line Virtual Processors.
%entc Shows the percentage of entitled capacity consumed. For a capped partition the percentage will not exceed 100%;
however, for uncapped partitions the percentage can exceed 100%.
lbusy Shows the percentage of logical processor utilization that occur while executing in user and system mode. Note: In
this example we’re using approx 25% of the logical processors. This is the “traditional” measure of CPU utilization Shared
using time-based sampling. As this value approaches 100% it may indicate that the partition could make use of Mode
additional VPs. Only
app Shows the number of available processors in the shared pool. The shared pool ‘psize’ is 2 processors. Must set
‘Allow shared processor pool utilization authority’. View the “properties” for a partition and click the Hardware tab,
then Processors and Memory.
vcsw Shows the number of virtual context switches.
phint Shows the number of phantom interrupts. A phantom interrupt is an interrupt that belongs to another shared partition.
%hypv / hcalls Shows the percentage of time spent in the hypervisor and the number of hypervisor calls.
* Can adjust cpu cycles required for other processors by computing a ratio between processor speeds
I/O Tuning – iostat -D Service times you could only get from filemon before
------------------------------------------------------------------------
Detailed Physical Volume Stats (512 byte blocks)
------------------------------------------------------------------------
Virtual Ethernet
Packets transferred in memory between partitions on the same
server
– Higher throughput than physical ethernet
– Physical devices do not support MTU 65394
Throughput linearly scales with processor entitlements
– MTU 9000 is 3X MTU 1500
– MTU 65394 is 7X MTU 9000
– Try to use the highest MTU
No unique TCP/IP tunables methodology
TCP Checksum Offloading
– Because virtual network does not suffer from physical network
link errors, checksums do not need to be generated (this is the
default in later AIX 5.3 levels)
• # chdev –l <device> -a chksum_offload=yes
Shared Ethernet
Heavy network load, use same sizings as dedicated systems
– MTU 1500, 1 CPU
– MTU 9000, 0.5 CPU
Shared processors
– Shared processors can result in higher latency, decreasing throughput
– For bursty network loads, use uncapped and allow for more entitlement than
would be allocated for a dedicated partition hosting the same application
Tools
• lsattr –El en#
• topas
• entstat, netstat
• seastat
– Tool from Nigel Griffith simplifies output, provides intervals
– http://www-941.ibm.com/collaboration/wiki/display/WikiPtype/nmon
Whenever there is a VIO Client/Server issue, check if there is a CPU constraint
first
– Add entitlement (shared) or increase CPUs (dedicated)
– Use larger MTU sizes if possible
Tools for
LPAR & CEC
Historical Performance
Local Recording
Metrics
– CPU/LPAR
• User, system, idle, wait, logical/physical busy, available pool processor,
pool size
• Entitlement, entitlement consumed
• CPU counts: physical, logical, virtual
– Memory
• Real: size, %client, %comp, %noncomp, %pinned, # frames
• Virtual: file page in/out, page in/out, page faults, frames stolen, IO requests
initiated by VMM
– System Calls/IO
• Total, read, write, fork, exec, read/write bytes
– Global Processes
• Number of processes, run queue, swap queue, process context switches
• PLM supplied utilization and load average
• No per-process/thread metrics
Local Recording
Metrics (cont.)
– Disk
• Busy, read/write blocks, average wait time, average service time
– Network
• LAN: KB in/out, transmit drops, receive drops, frames in/out
• Network Interface: input/output packets, input/output KB
• UDP: send/receive datagrams, full input socket count
• TCP: send/receive packets, bytes, totals
– Filesystem
• Size, free, %used
– Paging Space
• Size, %free, total size, total free
– NFS
• Client/Server calls, NFS v2 &v3 only
– Workload Manager
• CPU, memory, disk consumed percentage per superclass
Local Recording
Where is the data?
– Agent records to /etc/perf/daily/ directory
– Recordings name format: xmwlm.YYMMDD
How long is the recording kept?
– Initially, each 24-hour ‘daily’ recording was retained for 48 hours
– Now supports retension of one week of daily recordings
• Last 7 days
• APAR IY91882
– Customers should generate reports daily or archive recordings
as their needs require
How much disk space does this take?
– Dependent upon the number of devices
• System with a low number of disks: 2 MB/day
• 10 MB/day for every 100 disks
Report Types
– Formatted
• AIX 5.3 TL5
• ‘RMF-style’ text reports
• -R [ detailed | summary | disk | lan ] recording_file_name
• Output is written to standard out
– Redirect to customize file naming
Local Reports
Report Types (cont.)
– Nmon Analyzer
• AIX 5.3 TL5
• Format supported by Stephen Atkins Excel utility
– Same tool that supports Nmon
– Breakdown and graphs by system resources
• topasout -a recording_file_name
• Output is written directly to a file
– Format: xmwlm.YYMMDD.csv
– Ready for Import into Analyzer, currently version 3.2.5
• Http://www-941.haw.ibm.com/collaboration/wiki/display/Wikiptype/nmonanalyser
• APAR IY87993
Local Reports
Report Types (cont.)
– Legacy PTX
• AIX 5.3 ML4
• Pre-existing formats common with Performance Toolbox
reports
• -c recording_file_name Comma-separated output
• -s recording_file_name Spreadsheet import
• Output is written directly to a file
– Format: xmwlm.YYMMDD_01
------------------------------------dedicated-----------------------------------
ptoolsl1 A53 S 4.1 0.5 4 20 10 0 70 0.60
ptoolsl4 A53 4.1 0.5 2 100 0 0 0 2.00
ptoolsl6 A52 4.1 0.5 1 5 5 12 88 0.10
•M – System Mode
•c means capped, C - capped with SMT
•u means shared, U - uncapped with SMT
•S means SMT
© 2007 IBM Corporation
Advanced Technical Support, Americas
When?
– Command line: AIX 5.3 TL5
– SMIT Panels: AIX 5.3 TL6
Report Types
– Formatted
• -R [ detailed | summary ] recording_file_name
• Written to standard out, redirect to customize file naming
– Legacy PTX
• -c recording_file_name Comma-delimited format
• -s recording_file_name Spreadsheet format
• Output format: topas_cec.YYMMDD_01
• APAR IY93513 includes recent fixes
– No Nmon Analyzer format at this time
...
...
AIX TL06
• setup access to partitions Add Host to topas external subnet search file
(Rsi.hosts)
not on local subnet List hosts in topas external subnet search file
• generate reports
Show current recordings status
Generate Report
►to file
►to printer Configure Recordings
►to stdout
•Tools updated
• lparstat, mpstat andBarcelona
sar 2006
• topas and topasout reports
What's new in AIX 5.3 © IBM Corporation 2007
42 Advanced Technical Support, Americas
$ lparstat -i
System configuration: type=Dedicated mode=Donating
Node Name : smt=On lcpu=2 mem=800
va01
Partition Name :
va %user %sys %wait %idle physc vcsw
Partition Number : 2 ----- ---- ----- ----- ----- -------
Type : 0.1 0.4 0.0 99.5 0.68 670234
Dedicated-SMT 0.0 0.2 0.0 99.8 0.68 670234
Mode : 0.0 0.2 0.0 99.8 0.68 670234
Donating
Entitled Capacity :
1.00
Partition Group-ID :
32770
Shared Pool ID : - donation causes
Online Virtual CPUs : 1 hardware context
Maximum Virtual CPUs : 1 switches
Minimum Virtual CPUs : 1
Online Memory :
800 MB Stay relative to
Maximum Memory : partition capacity.
1024 MB
Minimum Memory :
128 MB In this case one shows actual physical processor
Variable Capacity Weight : - processor consumption:
Minimum Capacity :
1.00
Maximum Capacity :
number of physical processors
1.00 minus donated and stolen cycles
Capacity Increment :
1.00
Barcelona 2006
Maximum Physical CPUs in system : 4
Active Physical CPUs in system : 4
Active CPUs in Pool : -
Unallocated Capacity : -
Physical CPU Percentage :
What's new in AIX 5.3
100.00% © IBM Corporation 2007
U ll t d W i ht
43 Advanced Technical Support, Americas
IBM Global Services
Dedicated idle cycles donation - lparstat details
•New -d flags shows more details %idon, %bdon: percentages of
idle and busy times donated
•sar
• automatically displays phyc when donation is enabled
•mpstat
• automaticaly displays pc and lcs if donation is enabled
• new -h option to show more details on hypervisor related statistics
►donation enabled
System configuration: lcpu=2 mode=Donating
cpu pc ilcs vlcs idon bdon istol bstol
0 0.3 50327 687231635 10.2 4.5 0.59 0.32
1 0.5 61702 684989764 10.2 4.5 0.59 0.32
ALL 0.8 112029 1372221399 20.4 9.0 1.18 0.64
►shared partition
System configuration: lcpu=2 ent=0.5 mode=Uncapped
cpu Barcelona
pc ilcs 2006
vlcs
0 0.6 503727 687231635
1 0.6 61702 684989764
ALL 0.8 565429 1372221399
•Topasout report
Report: System Detailed --- hostname: ptoolsl1 version: 1.2
new version
Start:12/21/05 10.00.00 Stop:12/21/05 11.00.00 Int: 5 Min Range: 60 Min
number to mark
Time: 10.00.00 --------------------------------------------------------------
CPU UTIL MEMORY PAGING EVENTS/QUEUES NFS the new format
Kern 12.0 PhyB 0.7 Sz,GB 16.0 Sz,GB 4.0 Cswth 3213 SrvV2 32
User 8.0 Ent 0.0 InU 4.3 InU 2.3 Syscl 43831 CltV2 12
Wait 0.0 EntC 0.0 %Comp 3.1 Flt 221 RunQ 1 SrvV3 44
Idle 78.0 bdon 0.1 %NonC 9.0 Pg-I 87 WtQ 0 CltV3 18
SMT ON idon 1.0 %Clnt 2.0 Pg-O 44 VCSW 1214
LP 4 bstl 0.5
Mode Don istl 0.0
Thanks
Barcelona 2006