Keep your Portworx Deployment Up and Running
You can maintain a reliable and high-performing Portworx environment by following operational best practices that help you monitor, scale, protect, and optimize your cluster throughout its lifecycle. The following day-2 operations helps you to maintain the performance, availability, and resilience of your cluster.
-
Observe your Portworx cluster using Prometheus, Grafana, and Portworx dashboards. Track storage and performance metrics such as health, latency, IOPS, and throughput to maintain visibility and detect issues early.
-
Create snapshots of your volumes on demand or on a schedule. You can clone these snapshots to PVCs and use them with your applications.
-
Provision and manage storage pools built from local disks or cloud drives.
-
Scale your cluster automatically or manually using Autopilot. Adjust capacity and storage pool size based on usage thresholds and operational needs.
-
Migrate workloads between Kubernetes clusters with Stork. Use ClusterPair objects and migration schedules to move volumes and applications safely and consistently.
-
Set up disaster recovery using synchronous (Metro DR) or asynchronous methods.
-
Tune performance by configuring a dedicated data interface, enabling LACP bonding, adjusting runtime parameters, and optimizing topology, StorageClasses, and volume placement.
-
Troubleshoot your environment by collecting diagnostics, resolving common errors (including vSphere-specific issues), using systemd integration, and enabling telemetry with Pure1.