The Path to Achieve Ultra-Low Inference Latency With LLaMA 65B on PyTorch/XLA Background & State of the Art OpenTeams June 28, 2023