Skip to main content

Portworx CSI Alerts

Portworx CSI has a predefined set of alerts which are listed below. These alerts are critical for analysing the health and performance of your storage infrastructure. Each alert is categorized based on its type and severity, enabling efficient troubleshooting and management.

The table below outlines the key details:

  • Name: The identifier for the alert.
  • AlertType: The valid values for AlertType are: volume, node, cluster, drive, and pool, each representing a specific category of infrastructure components monitored for potential issues.
  • Severity: The level of importance, where "ALARM" indicates critical issues requiring immediate attention.

List of Portworx CSI Alerts

NameResourceTypeSeverityDescriptionMetric
MeteringAgentCriticalCLUSTERALARMTriggered when the metering agent encounters a critical problem.px_alerts_meteringagentcritical_total
LicenseExpiredCLUSTERALARMTriggered when the cluster license expires.px_alerts_licenseexpired_total
LicenseLeaseExpiredCLUSTERALARMTriggered when the license lease has expired since the last lease refresh failed.px_alerts_licenseleaseexpired_total
BaseAgentRegistrationFailedCLUSTERALARMBasge agent failed to registerpx_alerts_baseagentregistrationfailed_total
LicenseExpiringCLUSTERWARNINGWarning triggers 7 days before the license will expire. It will also keep triggering after the license has expired (e.g. “Trial license expired 4 days, 06:22 ago”).px_alerts_licenseexpiring_total
MeteringAgentWarningCLUSTERWARNINGTriggered when the metering agent encounters a non-critical problem.px_alerts_meteringagentwarning_total
LicenseLeaseExpiringCLUSTERWARNINGTriggered when the license lease is about to expire since the last lease refresh failed.px_alerts_licenseleaseexpiring_total
ClusterLicenseUpdatedCLUSTERNOTIFYTriggered when a license is updated for a cluster.px_alerts_clusterlicenseupdated_total
NodeStartFailureNODEALARMTriggered when a node in the Portworx cluster fails to start.px_alerts_nodestartfailure_total
NodeStateChangeNODEALARMNode state changed (i.e. it went down, came online etc.)px_alerts_nodestatechange_total
PXInitFailureNODEALARMTriggered when Portworx fails to initialize on a node.px_alerts_pxinitfailure_total
ClusterManagerFailureNODEALARMTriggered when Cluster manager on a Portworx node fails to start. The alert message will give more info about the specific error case.px_alerts_clustermanagerfailure_total
NodeDecommissionFailureNODEALARMTriggered when a node could not be decommissioned from Portworx cluster.px_alerts_nodedecommissionfailure_total
NodeInitFailureNODEALARMTriggered when Portworx fails to initialize on a node.px_alerts_nodeinitfailure_total
LicenseCheckFailedNODEALARMTriggered if a node fails a license check.px_alerts_licensecheckfailed_total
KvdbConnectionFailedNODEALARMTriggered if Portworx fails to connect to the KVDB.px_alerts_kvdbconnectionfailed_total
InternalKvdbSetupFailedNODEALARMTriggered if Portworx fails to setup Internal KVDB on a node.px_alerts_internalkvdbsetupfailed_total
PortworxMonitorImagePullFailedNODEALARMTriggered if Portworx fails to pull Portworx images during installation.px_alerts_portworxmonitorimagepullfailed_total
PortworxMonitorPrePostExecutionFailedNODEALARMTriggered if Portworx fails to execute pre or post installation tasks.px_alerts_portworxmonitorprepostexecutionfailed_total
PortworxMonitorMountValidationFailedNODEALARMTriggered if Portworx fails to validate mounts provided to Portworx container during installation.px_alerts_portworxmonitormountvalidationfailed_total
PortworxMonitorSchedulerInitializationFailedNODEALARMTriggered if Portworx fails to initialize connection with scheduler during installation.px_alerts_portworxmonitorschedulerinitializationfailed_total
PortworxMonitorServiceControlsInitializationFailedNODEALARMTriggered if Portworx fails to initialize the service controls during installation.px_alerts_portworxmonitorservicecontrolsinitializationfailed_total
PortworxMonitorInstallFailedNODEALARMTriggered if Portworx installation fails.px_alerts_portworxmonitorinstallfailed_total
MissingInputArgumentNODEALARMTriggered if there’s a missing input install argument.px_alerts_missinginputargument_total
InvalidArgumentNODEALARMInvalid input argumentpx_alerts_invalidargument_total
PXHostDependencyFailureNODEALARMHost does not meet dependencies for applied px configurationpx_alerts_pxhostdependencyfailure_total
CallHomeFailureNODEALARMCall home failurepx_alerts_callhomefailure_total
DiagCollectJobCancelledNODEALARMDiagCollect job cancelledpx_alerts_diagcollectjobcancelled_total
DiagCollectJobFailedNODEALARMDiagCollect job failedpx_alerts_diagcollectjobfailed_total
PXNodePrerequisiteMissingNODEALARMTriggered when Portworx is missing a prerequisite to startpx_alerts_pxnodeprerequisitemissing_total
ArrayLoginFailedNODEALARMTriggered when to login to FlashArray failspx_alerts_arrayloginfailed_total
MountpointCleanupFailedNODEALARMTriggered when mountpoint cleaner failspx_alerts_mountpointcleanupfailed_total
PXStateChangeNODEWARNINGTriggered when the Portworx daemon shuts down in error.px_alerts_pxstatechange_total
NodeDecommissionPendingNODEWARNINGTriggered when a node decommission is kept in pending state as it has data which is not replicated on other nodes.px_alerts_nodedecommissionpending_total
NodeMarkedDownNODEWARNINGTriggered when a Portworx node marks another node down as it is unable to connect to it.px_alerts_nodemarkeddown_total
SecretsAuthFailedNODEWARNINGSecrets setup has failedpx_alerts_secretsauthfailed_total
PortworxStoppedOnNodeNODEWARNINGTriggered if Portworx is stopped on a node.px_alerts_portworxstoppedonnode_total
KvdbConnectionWarningNODEWARNINGkvdb endpoint is not accessiblepx_alerts_kvdbconnectionwarning_total
NodeStartCannotProceedNODEWARNINGTriggered when Portworx startup on a node cannot proceed because a dependency has not been metpx_alerts_nodestartcannotproceed_total
NodeStartSuccessNODENOTIFYTriggered when a node in the Portworx cluster successfully initializes.px_alerts_nodestartsuccess_total
PXInitSuccessNODENOTIFYTriggered when Portworx successfully initializes on a node.px_alerts_pxinitsuccess_total
NodeDecommissionSuccessNODENOTIFYTriggered when a node is successfully decommissioned from Portworx cluster.px_alerts_nodedecommissionsuccess_total
PXReadyNODENOTIFYTriggered when Portworx is ready on a node.px_alerts_pxready_total
PortworxMonitorImagePullInProgressNODENOTIFYTriggered when Portworx is pulling and extracting images during installation or upgrade.px_alerts_portworxmonitorimagepullinprogress_total
DiagCollectJobStartedNODENOTIFYDiagCollect job started executionpx_alerts_diagcollectjobstarted_total
DiagCollectJobInProgressNODENOTIFYDiagCollect job in progresspx_alerts_diagcollectjobinprogress_total
DiagCollectJobFinishedNODENOTIFYDiagCollect job finished executionpx_alerts_diagcollectjobfinished_total
CCMstatusFailedNODENOTIFYCCM status check failedpx_alerts_ccmstatusfailed_total
CCMuploadFailedNODENOTIFYUpload to CCM failedpx_alerts_ccmuploadfailed_total
ROVolPodBounceNODENOTIFYTriggered when read-write (rw) volume mounts turn read-only (ro) due to errors. Application pods using them will be bounced.px_alerts_rovolpodbounce_total
KvdbEndpointsChangedNODENOTIFYTriggered when this node starts using a different set of kvdb endpoints.px_alerts_kvdbendpointschanged_total
KvdbBootstrapEntryAddedNODENOTIFYTriggered when this node adds an entry (usually for this node) to the internal KVDB bootstrap database.px_alerts_kvdbbootstrapentryadded_total
KvdbBootstrapEntryRemovedNODENOTIFYTriggered when this node removes an entry (for this or another node) from the internal KVDB bootstrap database.px_alerts_kvdbbootstrapentryremoved_total
KvdbMemberAddedNODENOTIFYTriggered when this node adds itself as a member to the internal KVDB cluster.px_alerts_kvdbmemberadded_total
KvdbMemberRemovedNODENOTIFYTriggered when this node removes itself or another node from the internal KVDB cluster.px_alerts_kvdbmemberremoved_total