The following scenario requires at minimum the following:
- OpenShift Cluster 4.17+
- 8 GPUs free (A100 or H100)
A kustomization.yaml
file exists to launch all required Kubernetes objects for the scenarios defined in the repository. To create run the following.
oc new-project llama-serve
oc apply -k kubernetes
This project uses uv
as its package manager for the python based notebooks and demo scripts. You can quickly set up your working environment by following these steps:
pip install uv
uv sync
source .venv/bin/activate
Once you are using the virtual environment, you should be good to run any script in app/src
.