Networking for AI inference model serving on GKE