Show how to autoscale GPU resources on OpenShift in AWS using the following...
Objects:
MachineSetMachineSetAutoScalerClusterAutoscaler
Operators:
- OpenShift 4.16+
- role:
cluster-admin
- role:
Red Hat Demo Platform - Options (Tested) Below
NOTE: The node sizes below are the recommended minimum to select for provisioning
- AWS with OpenShift Open Environment
- 1 x Control Plane -
m6a.2xlarge - 0 x Workers -
m6a.2xlarge
- 1 x Control Plane -
- One Node OpenShift
- 1 x Control Plane -
m6a.2xlarge
- 1 x Control Plane -
- OpenShift GitOps Blank Environment
- 1 x Control Plane -
16 cores,64Gi
- 1 x Control Plane -
Install the OpenShift Web Terminal
The following icon should appear in the top right of the OpenShift web console after you have installed the operator. Clicking this icon launches the web terminal.
NOTE: Reload the page in your browser if you do not see the icon after installing the operator.
# apply the enhanced web terminal
oc apply -k https://github.com/redhat-na-ssa/demo-ocp-autoscale-gpu/demo/web-terminal
# delete old web terminal
$(wtoctl | grep 'oc delete')NOTE: The following commands are available in an enhanced web terminal.
Run directly from remote
until oc apply -k https://github.com/redhat-na-ssa/demo-ocp-autoscale-gpu/demo; do : ; doneQuery the status of the cluster autoscaler
oc -n openshift-machine-api \
describe cm cluster-autoscaler-statusoc delete -k demo/uninstallReset the web terminal
oc apply -k components/operators/web-terminal/reset-defaultNOTE: If you have manually installed operators through the OpenShift web console before running this automation there may be duplicate operator groups.
Manually Remove any duplicate operator groups ending in -xxxxx or use the following command in the enhanced web terminal.
ocp_fix_duplicate_operator_groups