Rongzhi Zhang

Ph.D. Candidate
Machine Learning Center
School of Computational Science and Engineering
Georgia Institute of Technology

Office: CODA E1317
Address: 756 W Peachtree St NW, Atlanta, GA 30308
Email: rongzhi.zhang@gatech.edu
External Links:

Biography

I am a Machine Learning Ph.D. candidate at Georgia Tech (ML@GT), advised by Prof. Chao Zhang. My research interest primarily lies in model efficiency and data efficiency of language models. Beyond academia, I've spent several fantastic research internships at Google Research, Microsoft Azure AI, and Amazon Stores Foundational AI.

Before that, I obtained my bachelor's degree from Zhejiang University, and I spent my senior year as a visiting student researcher at Harvard Medical School.

News

[-- Pinned --] Join the Journey: I am currently diving into exciting projects about Large Language Models (LLM), seeking collaborators/mentees with expertise in this field! Computational resources and preliminary validated ideas provided. Contact me if interested in collaboration!
[Sept. 2024] Check our preprint LoRC, we proposed a progressive KV-cahce compression strategy on the weight matrix level.
[Sept. 2024] One paper accepted to NeurIPS'24, we proposed a test-time LLM alignment approach via representation editing.
[July 2024] One paper accepted to the 1st COLM, we presented a teacher-student framework to improve LLM reasoning capability via principle discovery.
[May 2024] One paper accepted to KDD'24, we proposed a novel knowledge distillation objective for LLMs by perturbing the standard KL loss.
[May 2024] Two papers accepted to ACL'24 (findings), discussing 1) preference-based distillation for LLMs and 2) attributed data synthesis via LLMs.
[May 2024] I will join Amazon (Rufus Team) as a research intern in Summer 2024, exploring preference learning for LLM alignment.
[Dec. 2023] I will join Microsoft Research as a research intern in Spring 2024, exploring KV Cache compression for LLMs.
[May 2023] One paper accepted to KDD'23, we proposed an iterative and adaptive framework for boosting in weakly-supervised learning settings.
[May 2023] Two papers accepted to ACL'23, discussing cold-start data selection for few-shot LM finetuning and retrieval enhanced LM.
[Mar. 2023] I will be back to Google Reseach NYC as a student researcher this summer.

Education

Georgia Institute of Technology, Atlanta, Aug. 2019 - Present
Ph.D. in Machine Learning
M.S. in Electrical and Computer Engineering (May 2021)
Zhejiang University, Hangzhou, Aug. 2015 - June 2019
B.Eng. in Measurement Control Technology and Instruments
Harvard Medical School, Boston, Sep. 2018 - May 2019
Visiting Student Researcher in Neural System Group

Research

My research focuses on advancing language models in three key aspects:

Model Efficiency: Progressive KV cache compression (Preprint); Knowledge distillation (KDD'24, ACL'24, COLM'24)
Model Alignment: Robust reward modeling (Preprint); Test-time alignment (NeurIPS'24)
Data-centric Approach: Data selection and weak supervision (ACL'22, ACL'23, KDD'23)

Preprints

Quality-Aware Preference Data Weighting for Generalizable Reward Model
Rongzhi Zhang, Chenwei Zhang, Xinyang Zhang, Liang Qiu, Haoming Jiang, Yuchen Zhuang, Qingru Zhang, Hyokun Yun, Xian Li, Bing Yin, Tuo Zhao, Chao Zhang.
An arXiv version will be available soon.

Publications

LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy
Rongzhi Zhang, Kuan Wang, Liyuan Liu, Shuohang Wang, Hao Cheng, Chao Zhang and Yelong Shen.
In Machine Learning and Compression Workshop of Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.

Aligning Large Language Models with Representation Editing: A Control Perspective
Lingkai Kong, Haorui Wang, Wenhao Mu, Yuanqi Du, Yuchen Zhuang, Yifei Zhou, Yue Song, Rongzhi Zhang, Kai Wang and Chao Zhang.
In Annual Conference on Neural Information Processing Systems (NeurIPS), 2024.

TPD: Enhancing Student Language Model Reasoning via Principle Discovery and Guidance
Haorui Wang, Rongzhi Zhang, Yinghao Li, Lingkai Kong, Yuchen Zhuang, Xiusi Chen and Chao Zhang.
In the 1st Conference on Language Modeling (COLM), 2024.

Knowledge Distillation with Perturbed Loss: From a Vanilla Teacher to a Proxy Teacher
Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Jialu Liu, Michael Bendersky, Marc Najork and Chao Zhang.
In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024.

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs
Rongzhi Zhang, Jiaming Shen, Tianqi Liu, Haorui Wang, Zhen Qin, Feng Han, Jialu Liu, Simon Baumgartner, Michael Bendersky and Chao Zhang.
In Findings of Annual Meeting of the Association for Computational Linguistics (ACL), 2024.

ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models
Yuzhao Heng, Chunyuan Deng, Yitong Li, Yue Yu, Yinghao Li, Rongzhi Zhang, Chao Zhang
In Findings of Annual Meeting of the Association for Computational Linguistics (ACL), 2024.

Local Boosting for Weakly-Supervised Learning
Rongzhi Zhang, Yue Yu, Jiaming Shen, Xiquan Cui and Chao Zhang.
In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2023.

Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach
Yue Yu, Rongzhi Zhang, Ran Xu, Jieyu Zhang, Jiaming Shen and Chao Zhang.
In Annual Meeting of the Association for Computational Linguistics (ACL), 2023.

Zero-Shot Text Classification by Training Data Creation with Progressive Dense Retrieval
Yue Yu, Yuchen Zhuang, Rongzhi Zhang, Yu Meng, Jiaming Shen and Chao Zhang.
In Findings of Annual Meeting of the Association for Computational Linguistics (ACL), 2023.

PRBoost: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning
Rongzhi Zhang, Yue Yu, Shetty Pranav, Le Song and Chao Zhang.
In Annual Meeting of the Association for Computational Linguistics (ACL), 2022.

Adaptive Multi-view Rule Discovery for Weakly-Supervised Compatible Products Prediction
Rongzhi Zhang, Rebecca West, Xiquan Cui and Chao Zhang.
In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022.

AcTune: Uncertainty-Aware Active Self-Training for Active Fine-Tuning of Pretrained Language Models
Yue Yu, Lingkai Kong, Jieyu Zhang, Rongzhi Zhang and Chao Zhang.
In Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), 2022.

SeqMix: Augmenting Active Sequence Labeling via Sequence Mixup
Rongzhi Zhang, Yue Yu and Chao Zhang.
In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020.

Experiences

Applied Scientist Intern | May 2024 - Nov. 2024
Amazon Stores Foundational AI, Palo Alto, CA
Host: Chenwei Zhang, Xinyang Zhang
Research Intern | Jan. 2024 - May 2024
Microsoft Azure AI, Redmond, WA
Host: Shuohang Wang, Lucas Liu, Yelong Shen
Student Researcher | May 2023 - Dec. 2023
Google Research, New York City, NY
Host: Jiaming Shen, Tianqi Liu, Jialu Liu
Student Researcher | May 2022 - Dec. 2022
Google Research, New York City, NY
Host: Jiaming Shen, Tianqi Liu, Michael Bendersky

Teaching

Graduate Teaching Assistant | Aug. 2023 - Dec. 2023
CSE 8803 - Deep Learning for Text Data (Fall 2023), Georgia Tech
Graduate Teaching Assistant | Aug. 2021 - Dec. 2021
CSE 8803 - Deep Learning for Text Data (Fall 2021), Georgia Tech
Graduate Teaching Assistant | Aug. 2020 - Dec. 2020
CS 4641/7641 - Machine Learning (Fall 2020), Georgia Tech

Academic Service

Area Chair

ACL Rolling Review 2025

Reviewer

EMNLP 2022 – Present
ACL 2022 – Present
ACL Rolling Review 2023 – Present
KDD 2023 – Present
NeurIPS 2023 – Present
ICML 2024 – Present
ICLR 2024 – Present
COLM 2025

Misc

I was a player of Zhejiang University Varsity Men's basketball team, competing in CUBA Division II.