VRAM (Video RAM) is GPU memory. Everything during training must fit in VRAM:
- Model parameters
- Optimizer states
- Gradients
- Activations
- Input batch
Consumer GPUs have -GB. Data center GPUs have -GB. Your fine-tuning method depends heavily on available VRAM. Out of memory errors mean something doesn't fit.