You are on page 1of 7

 

DIMM replacement on X400 node Suscribirse 


Visto 311 veces

tstepn...@axs.tv 27 jul 2020, 21:03:13   


a Isilon Technical User Group

Got an alert: HW_CTO_PHYS_MEM: Physical Memory low (expected 48GB, found 40GB)

Found this doc: Isilon DIMM replacement policy for Isilon nodes - Event ID: 900010007,
900160004
Article Number: 471888

Running 8.0.0.4 code on Gen4 X400 node

Got on affected node. So it does only see 40GB:


node-8# isi_hw_status | grep RAM
RAM: 42905305088 Bytes
node-8# isi_dmilog
Log:
Totals:

But since the log has no info, how do I know which DIMM needs replacing?

Thanks,
Tom

Anurag Chandra 27 jul 2020, 21:18:52   


a isilon-u...@googlegroups.com

/var/log/isi_hwmon.log on that node should have some info

On Tue, 28 Jul 2020 at 12:33 AM, tstepn...@axs.tv <tstepn...@axs.tv> wrote:


Got an alert: HW_CTO_PHYS_MEM: Physical Memory low (expected 48GB, found 40GB)

Found this doc: Isilon DIMM replacement policy for Isilon nodes - Event ID: 900010007,
900160004
Article Number: 4718g
Running 8.0.0.4 code on Gen4 X400 node

Got on affected node. So it does only see 40GB:


node-8# isi_hw_status | grep RAM
RAM: 42905305088 Bytes
node-8# isi_dmilog
Log:
Totals:

But since the log has no info, how do I know which DIMM needs replacing?

Thanks,
Tom

--
You received this message because you are subscribed to the Google Groups "Isilon Technical
User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to isilon-user-
gr...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/isilon-user-
group/eb45ad28-1499-4043-b53a-e7bca0ee02d0n%40googlegroups.com.

mandar kolhe 27 jul 2020, 21:20:13   


a isilon-u...@googlegroups.com

Can you please share output of below command:

# dimidecode -t memory

Also just to be sure you are checking dmilog on right node ?

Their is bug in gen4 & gen5 node for dimm issues so in that nodes messages are you seeing any
MCA errors ?

Thanks,
Mandar



--
You received this message because you are subscribed to the Google Groups "Isilon Technical
User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to isilon-user-
group+unsubscribe@googlegroups.com.

tstepn...@axs.tv 29 jul 2020, 19:24:39   


a Isilon Technical User Group

Yes, I'm on the right node. Output below:

node-8# cat /var/log/isi_hwmon.log


Feb 11 21:37:31 newsyslog[977]: logfile first created
node-8# dmidecode -t memory
# dmidecode 2.10
SMBIOS 2.6 present.

Handle 0x0038, DMI type 16, 15 bytes


Physical Memory Array
Location: System Board Or Motherboard
Use: System Memory
Error Correction Type: Multi-bit ECC
Maximum Capacity: 192 GB
Error Information Handle: Not Provided
Number Of Devices: 6

Handle 0x003A, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0038
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: P1-DIMM1A
Bank Locator: BANK0
Type: <OUT OF SPEC>
Type Detail: Other
Speed: 1333 MHz
Manufacturer: Hyundai
Serial Number: 2BA63451
Asset Tag: AssetTagNum0
Part Number: HMT41GR7AFR8A-PB
Rank: Unknown

Handle 0x003C, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0038
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P1-DIMM1B
Bank Locator: BANK1
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer01
Serial Number: SerNum01
Asset Tag: AssetTagNum1
Part Number: ModulePartNumber01
Rank: Unknown

Handle 0x003E, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0038
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: P1-DIMM2A
Bank Locator: BANK2
Type: <OUT OF SPEC>
Type Detail: Other
Speed: 1333 MHz
Manufacturer: Hyundai
Serial Number: BEA63451
Asset Tag: AssetTagNum2
Part Number: HMT41GR7AFR8A-PB
Rank: Unknown

Handle 0x0040, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0038
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P1-DIMM2B
Bank Locator: BANK3
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer03
Serial Number: SerNum03
Asset Tag: AssetTagNum3
Part Number: ModulePartNumber03
Rank: Unknown

Handle 0x0042, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0038
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: P1-DIMM3A
Bank Locator: BANK4
Type: <OUT OF SPEC>
Type Detail: Other
Speed: 1333 MHz
Manufacturer: Hyundai
Serial Number: 0AA63451
Asset Tag: AssetTagNum4
Part Number: HMT41GR7AFR8A-PB
Rank: Unknown

Handle 0x0044, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0038
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P1-DIMM3B
Bank Locator: BANK5
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer05
Serial Number: SerNum05
Asset Tag: AssetTagNum5
Part Number: ModulePartNumber05
Rank: Unknown

Handle 0x0046, DMI type 16, 15 bytes


Physical Memory Array
Location: System Board Or Motherboard
Use: System Memory
Error Correction Type: Multi-bit ECC
Maximum Capacity: 192 GB
Error Information Handle: Not Provided
Number Of Devices: 6

Handle 0x0048, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0046
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P2-DIMM1A
Bank Locator: BANK6
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer06
Serial Number: SerNum06
Asset Tag: AssetTagNum6
Part Number: ModulePartNumber06
Rank: Unknown

Handle 0x004A, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0046
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P2-DIMM1B
Bank Locator: BANK7
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer07
Serial Number: SerNum07
Asset Tag: AssetTagNum7
Part Number: ModulePartNumber07
Rank: Unknown

Handle 0x004C, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0046
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: P2-DIMM2A
Bank Locator: BANK8
Type: <OUT OF SPEC>
Type Detail: Other
Speed: 1333 MHz
Manufacturer: Hyundai
Serial Number: 2AA73451
Asset Tag: AssetTagNum8
Part Number: HMT41GR7AFR8A-PB
Rank: Unknown

Handle 0x004E, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0046
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P2-DIMM2B
Bank Locator: BANK9
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer09
Serial Number: SerNum09
Asset Tag: AssetTagNum9
Part Number: ModulePartNumber09
Rank: Unknown

Handle 0x0050, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0046
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: 8192 MB
Form Factor: DIMM
Set: None
Locator: P2-DIMM3A
Bank Locator: BANK10
Type: <OUT OF SPEC>
Type Detail: Other
Speed: 1333 MHz
Manufacturer: Hyundai
Serial Number: 04A73451
Asset Tag: AssetTagNum10
Part Number: HMT41GR7AFR8A-PB
Rank: Unknown

Handle 0x0052, DMI type 17, 28 bytes


Memory Device
Array Handle: 0x0046
Error Information Handle: Not Provided
Total Width: 72 bits
Data Width: 64 bits
Size: No Module Installed
Form Factor: DIMM
Set: None
Locator: P2-DIMM3B
Bank Locator: BANK11
Type: <OUT OF SPEC>
Type Detail: Other
Speed: Unknown
Manufacturer: Manufacturer11
Serial Number: SerNum11
Asset Tag: AssetTagNum11
Part Number: ModulePartNumber11
Rank: Unknown

I can now identify the Location / Bank Location info for each module. Output in bold is missing info
("not like the others"), so I'm assuming that the slot is either empty, or this is the DIMM that needs
replacement. ..?

On Monday, July 27, 2020 at 1:20:13 PM UTC-6 kolhem...@gmail.com wrote:


Can you please share output of below command:

# dimidecode -t memory

Also just to be sure you are checking dmilog on right node ?

Their is bug in gen4 & gen5 node for dimm issues so in that nodes messages are you seeing
any MCA errors ?

Thanks,
Mandar

On Jul 28, 2020 12:33 AM, "tstepn...@axs.tv" <tstepn...@axs.tv> wrote:


Got an alert: HW_CTO_PHYS_MEM: Physical Memory low (expected 48GB, found 40GB)

Found this doc: Isilon DIMM replacement policy for Isilon nodes - Event ID: 900010007,
900160004
Article Number: 471888
Running 8.0.0.4 code on Gen4 X400 node

Got on affected node. So it does only see 40GB:


node-8# isi_hw_status | grep RAM
RAM: 42905305088 Bytes
node-8# isi_dmilog
Log:
Totals:

But since the log has no info, how do I know which DIMM needs replacing?

Thanks,
Tom

--
You received this message because you are subscribed to the Google Groups "Isilon
Technical User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to isilon-
user-gr...@googlegroups.com.

You might also like