|
|
EVENT DETAILS |
This is an in-person event, Food & drink will be provided. Join us for networking & socializing. The talk is at AMD.
Speaker: Nick Ni, Sr Director AI Product Management of AMD
Talk Abstract:
Large language models (LLMs) like GPT-3 have demonstrated impressive capabilities in natural language processing. However, running these massive neural networks requires significant computational resources. New & powerful data center GPUs from AMD offer powerful performance optimized for training & inference. In this presentation, we explore using open software platform to run LLMs. Standard frameworks like PyTorch & TensorFlow are used as well as open libraries such as vLLM. We benchmark performance of various sized LLMs like Llama2 & Bloom. Our results demonstrate that comparable or better performance can be achieved. Key optimizations include efficiently mapping matrix multiplication & attention layers. With careful tuning, better performance & cost-effective deployment, large language models are possible for a wide array of applications.
Meetup agenda:
6-6:30 pm Check in, food & drink, networking
6:30-7:30 pm Talk by Nick Ni
7:30-8 pm Q&A & additional networking
To expedite the onsite security registration, please fill in these few questions so we can pre-register you - https://docs.google.com/forms/d/e/1FAIpQLSepbm4dPWgMpPTGDf5UzpbhP4RRkZDLCe4Hn1LnnXVOG5jhfA/viewform
|
|
|
|
|
|