Build and operate an inference server on EC2 independently.
This is incorrect. Building on EC2 can achieve real-time inference, but the company must build, patch, and scale the server itself. This does not meet the requirement of 'minimizing operational overhead.'