QLoRA uses NF4 (NormalFloat -bit), a quantization format designed for neural network weights.
Neural network weights roughly follow a normal distribution. NF4 places quantization levels to optimally represent this distribution. More levels near zero (where most weights cluster), fewer in the tails.
NF4 loses less information than uniform -bit quantization.