Welcome to Qining Zhang's Homepage

Qining Zhang (张启宁)
Ph.D. Candidate
Department of Electrical Engineering and Computer Science
University of Michigan, Ann Arbor
Address: 1301 Beal Ave, Ann Arbor, MI, USA, 48105
E-mail: qiningz AT umich Dot edu

About me

Hi! I am a 5th-year Ph.D. student at the Department of Electrical Engineering and Computer Science of the University of Michigan - Ann Arbor, USA. I'm very fortunate to be advised by Prof. Lei Ying. I received M.S. in Mathematics and the Rackham Predoctoral Fellowship from UM. I received my bachelor's degree in Electrical Engineering from Tsinghua University, China, in 2021, mentored by Prof. Jintao Wang and Dr. Haoyue Tang. I also had a wonderful summer in 2024 as an Applied Scientist Intern with the Amazon Search Experience Science team, mentored by Dr. Yi Liu and Dr. Tanner Fiez.

Research

My research explores the mathematical foundation of stochastic systems, algorithms, and general applied probability problems. My main research interests include:

Reinforcement learning and policy optimization;
Sequential decision-making under uncertainty, bandits, experimental design, and online optimization;
Stochastic control and network optimization.

I also co-organized the Michigan RL Seminar series at the University of Michigan -- Ann Arbor.

Publications

Preprints

Efficient Federated Reinforcement Learning from Human Feedback via Zeroth-Order Policy Optimization
Deyi Wang*, Qining Zhang*, Lei Ying.
Equal Contribution, Under Review, 2026.
SP3O: Reinforcement Learning from Segment Preferences without Reward Modeling
Evan Assmus, Qining Zhang, Lei Ying.
Under Review, 2026.

Journals and Journal-Quality Conferences

Sign-SZPO: Provable Preference-based Reinforcement Learning with an Unknown Link Function
Qining Zhang, Lei Ying.
Reinforcement Learning Conference (RLC) and Reinforcement Learning Journal (RLJ), 2026. (Acceptence Rate: 33.9%)
A short version is accepted at the MLxOR Workshop of NeurIPS, 2025.
Multi-Metric Adaptive Experimental Design under Fixed Budget with Validation
Qining Zhang, Tanner Fiez, Yi Liu, Wenyang Liu.
Conference on Artificial Intelligence and Statistics (AISTATS), 2026. (Acceptence Rate: 28.0%)
Fingerprinting and Quantification of Procyanidins via LC-MS/MS and ESI In-Source Fragmentation
Yanxin Lin, Helene Hopfer, Qining Zhang, Misha T. Kwasniewski.
Journal of Agricultural and Food Chemistry, 2025.
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
Qining Zhang, Lei Ying.
International Conference on Learning Representations (ICLR), 2025. (Acceptence Rate: 32.1%)
Reinforcement Learning from Human Feedback without Reward Inference: Model-Free Algorithm and Instance-Dependent Analysis
Qining Zhang, Honghao Wei, Lei Ying.
Reinforcement Learning Conference (RLC) and Reinforcement Learning Journal (RLJ), 2024.
Cost Aware Best Arm Identification
Kellen Kanarios, Qining Zhang, Lei Ying.
Reinforcement Learning Conference (RLC) and Reinforcement Learning Journal (RLJ), 2024.
Deep Reinforcement Learning for Early Diagnosis of Lung Cancer
Yifan Wang, Qining Zhang, Chuan Zhou, Lei Ying.
AAAI Conference on Artificial Intelligence (AAAI), 2024. (Acceptence Rate: 24.2%)
Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms
Qining Zhang, Lei Ying.
Conference on Neural Information Processing Systems (NeurIPS), 2023. (Acceptence Rate: 26.1%)
On Low-Complexity Quickest Intervention of Mutated Diffusion Processes Through Local Approximation
Qining Zhang, Honghao Wei, Weina Wang, Lei Ying.
ACM Mobihoc, 2022. (Acceptence Rate: 20%)
Online Utility Optimization in Multi-User Interference Networks Under a Long-Term Budget Constraint
Yuchao Chen, Jintao Wang, Qining Zhang, Feifei Gao.
IEEE Transactions on Vehicular Technology, 2022.

Other Publications

Early Lung Cancer Diagnosis from Virtual Follow-up LDCT Generation via Correlational Autoencoder and Latent Flow Matching
Yutong Wu, Yifan Wang, Qining Zhang, Chuan Zhou, Lei Ying.
The W3PHIAI-26 Workshop at AAAI and Studies in Computational Intelligence, 2026.
Online Optimizing Multi-user Interference Network Utility with Unknown CSI under Budget Constraint
Yuchao Chen, Jintao Wang, Qining Zhang, Feifei Gao.
IEEE Wireless Communications and Networking Conference (WCNC), 2022.
Minimizing the Age of Synchronization in Power-constrained Wireless Networks with Unreliable Time-varying Channels
Qining Zhang, Haoyue Tang, Jintao Wang.
Age-of-Information Workshops, IEEE INFOCOM, 2020.

Professional Services

Conference Reviewer: ICLR / NeurIPS / AISTATS / INFOCOM / L4DC / RLC / EWRL

Journal Reviewer: IEEE TPAMI / IEEE ToN / Automatica / Performance Evaluation / TMLR