They control the randomness, length, and diversity of output at generation time.
This is correct. Inference parameters are settings that adjust the behavior at generation (inference) time without changing the model's weights. They include output randomness (temperature), how candidates are narrowed (Top-P/Top-K), and length (maximum token count).