8
GPU Multiplexing
+100 XP5 min8 / 13
Overview: GPU Multiplexing
Overview: GPU Multiplexing
MIG (Multi-Instance GPU) partitions the GPU in hardware — up to 7 isolated instances on an A100, each with guaranteed memory and compute. Time-slicing shares GPU in software — up to 90% cost savings but no memory isolation. MPS enables concurrent CUDA kernels for lightweight embedding jobs.
1 of 3