Run ML training on a GPU VM
An end-to-end tutorial for creating an ECI GPU VM and running your first PyTorch training job.
Run LLM inference on a GPU VM
A tutorial for running a large language model on an ECI GPU VM using vLLM.
Deploy a Hugging Face model with FastAPI
Tutorial on deploying a Hugging Face model as a FastAPI server.
InfiniBand setup and benchmarking
How to set up InfiniBand on an ECI virtual cluster and measure NCCL communication performance.
Download and test a Hugging Face model
Tutorial on downloading a model from the Hugging Face Hub and running inference on an ECI VM.