2522, 12:52.AM Document Display
Copyright(c) 2022, Oracle Al rights reserved, Oracle Confidential
OLVM: How OLVM Monitor Storage Health (Doc ID 2705707.1)
In this Document
Goal
Solution
Monitor Storage Pool Manager Health
Monitor Storage Domain Health
References
Linux OS - Version Oracle Linux 7.5 with Unbreakable Enterprise Kernel [4.14.35] and later
Linux x86-64
This article will illustrate how OLVM monitors storage health in detail
Monitor Storage Poo! Manager Health
‘The Storage Poo! Manager (SPM) is @ management role assigned to one of the hosts in a data center enabling it to manage
the storage domains of the data center, OLVM check SPM availability and metadata integrity at every SPM polling rate 10
seconds,
You can check those engine configuration as follows. It's not recommended to change below parameters unless itis
recommended by Support engineer for speicific use case:
§ engine-config -3 Se
ragePoolRefreshTinel
on: general
3 engine-config -g SpmConmandFailoverRetries
‘SpmConmandF neral
§ engine-con: Overat
SPMFailoverAtcempts: 3 version
3 engine-config -3 DelayResetForspminseconds
DelayResetForSpmingeconds: 20 version: general
About the explaination of Engine config options, please refer to Doc ID 2696009.1,
Monitor Storage Domain Health
The storage domain health is monitored by both the OLVM engine and KVM Vdsm.
1. OLVM monitors storage domait
The time interval in seconds to poll a Host status by OLVM is 3 sec by default, During the process, it will check the storage
status. If the getRepoStats reports code is non-zero or lastcheck(statsGenTime - domStatus.checkTime) higher than
MaxStorageVdsTimeoutCheckSec(30 seconds by default), then storage domain becomes problematic and a timer starts for the
hitpsisupport oracle comiepmosiacesiSearchDocDisplay?_ad.ctr-state=yGsai0hz9_4 122522, 12:52.AM Document Display
domain. The storage domain failure timeout is StorageDomainFailureTimeoutInMinutes(5 minutes default). If the problematic
storage domain status isn’t recovered during this time, then the host will be set as non-operation. If the problematic domain is
recovered during the time, then OLVM will activate the KVM host back automatically
Here are related engine configuration:
ehRate
version: general
2. KVM host monitor storage health:
If KVM host can't access the storage domains, it will become Non-Operational. Vdsm will refresh storage with
sd_health_check_delay monitor interval. It invokes getStorageDomainStats to get dom.getStats the domain status and return
its status back to OLVM via repoStats in _getDomsStats. Both repo_stats_cache_refresh_timeout and sd_health_check_delay
can be configured in /etc/vdsm/vdsm conf
Didnt find what you are looking for?
hitpsisupport oracle comiepmosiacesiSearchDocDisplay?_ad.ctr-state=yGsai0hz9_4 22.