Model size
This is incorrect. Model size (number of parameters) is an indicator of accuracy and required resources. In general, larger models tend to be more accurate but slower to respond — this can actually work against the immediate-response requirement.