After customization, there is no token-based charge and only a retention cost remains.
Even with a custom model, inference incurs a charge based on tokens and provisioned capacity.
The understanding that the charge disappears is wrong, so this is incorrect.