- Deep Learning
- Efficient AI
Hoai Chau held a BSc at the University of Science Ho Chi Minh City. He is currently working as a Research Assistant at VinUniversity under the guidance of Prof. Doan Dang Khoa and Prof. Heng Ji (UIUC).
His research interest is low-resource and deep-learning model compression.
His current research focuses on techniques to compress Transformer-based models, including Large Language Models (LLMs) and Vision Transformers (ViTs). This includes approaches like quantization, token merging, KV cache compression, and developing more efficient decoding algorithms.