InstaScale is a controller that works with Multi-cluster-app-dispatcher (MCAD) to get aggregated resources available in the kubernetes cluster without creating pending pods. It uses machinesets to launch instances on cloud provider to be added to the Kubernetes cluster.
Key features:
- Acquires aggregated heterogenous instances needed for workload execution.
- Does not clog Kubernetes control plane.
- Works with your Kubernetes scheduling system to schedule pods on aggregated resources.
- Terminates instances on workload completion.
- User submits Multi GPU job(s)
- Job(s) lands in MCAD queue
- When resources are not available it triggers scaling i.e. calls InstaScale
- InstaScale looks at resource requests specified by the user and matches those with the desired Machineset(s) to get nodes.
- After InstaScal-ing, when aggregate resources are available to run the job MCAD dispatches the job.
- When job completes, resources obtained for the job are released.
- Installed Go version 1.19
- Running OpenShift cluster
- To build locally :
make build
- To run locally :
make run
- To build and release a docker image for controller :
make IMG=quay.io/project-codeflare/instascale:<TAG> image-build image-push
- Note that the other contents of the Makefile (as well as the
config
andbin
dirs) exist for future operator development, and are not currently utilized
- Deploy InstaScale using:
make deploy
-
Deploy MCAD using steps here.
-
In Visual Studio Code update
.vscode/launch.json
so that"KUBECONFIG"
points to your Kubernetes config file. -
If you changed the namespace in
config/default/kustomization.yaml
update theargs[]
inlaunch.json
to include"--configs-namespace=<YOUR_NAMESPACE>", "--ocm-secret-namespace=<YOUR_NAMESPACE>"
. -
You can now run the local deployment with the debugger.
Running InstaScale locally to an OSD cluster requires extra steps from the above.
- Add the
instascale-ocm-secret
- Get your API token from here
- Navigate to Workloads -> secrets
- Select your project to
instascale-system
- Click Create -> Key/value secret
- Secret name:
instascale-ocm-secret
- Key:
token
- Value:
<YOUR_API_TOKEN>
- Click Create
Run tests with command:
go test -v ./controllers/
Prerequisite:
- Build and release MCAD
- Make sure that MCAD version is published on Go package site
- Run instascale-release.yml action.
- Verify that instascale-release.yml action passed successfully.