A Fast, Scalable Gen AI Inference Platform