What would you like to be added:
An application load balancer that handles requests taking consideration of the current loads (CPU/MEM/GPU/...) of cluster.
Why is this needed:
To get good performance and optimal utilization of nodes.
others
/kind feature