Architecture and System Co-Design for Scalable Large Language Model Inference
Speaker:Mr. GU Yufeng, Ph.D. candidate (advised by Prof. Reetuparna Das), University of Michigan ; Title:Architecture and System Co-Design for Scalable Large Language Model Inference