I'm a Research Scientist at Google DeepMind working on improving Gemini's fundamental capabilities for retrieval and ranking.

I completed my Ph.D. in Statistics at the University of Michigan in 2025, where I was fortunate to be advised by Ambuj Tewari. My Ph.D. was graciously supported by the National Science Foundation Graduate Research Fellowship (NSF GRFP) and the 2025 Apple Scholars in AI/ML PhD Fellowship. Prior to my Ph.D, I double-majored in Computer Science and Chemical Engineering and worked with Mahdi Cheraghchi, Sindhu Kutty, and Andrej Lenert.

My research interests lie in the Foundations of Machine Learning. During my Ph.D, I worked on various topics in learning theory, including online learning, adversarial robustness, differential privacy, and language generation, among other things. Nowadays, I work on post-training for large language models, particularly reasoning and adaptive inference-time compute.

Apart from research, I am a fan of bodybuilding and actively keep up with the Mr. Olympia.

Preprints

Estimating the (Un)seen: Sample-dependent Mass Estimation PDF
with Vitaly Feldman, Satyen Kale, Kunal Talwar, and Ambuj Tewari
Preprint, 2025.
Transductive and Learning-Augmented Online Regression PDF
with Shenghao Xie , Samson Zhou
Preprint, 2025.
Online Boosting for Multilabel Ranking with Top-k Feedback PDF
with Daniel T. Zhang, Young Hun Jung, Ambuj Tewari
Preprint, 2020.

In Submission

GroupDPO: Memory efficient Group-wise Direct Preference Optimization
with Jixuan Leng, Si Si , Hsiang-Fu Yu , Inderjit S Dhillon
In Submission, 2026.
On Generation in Metric Spaces PDF
with Jiaxun Li , Ambuj Tewari
In Submission, 2026.
Optimal Stopping vs Best-of-N for Inference Time Optimization PDF
with Yusuf Kalayci , Shaddin Dughmi
In Submission, 2026.
AdaBoN: Adaptive Best-of-N Alignment PDF
with Hilal Asi , Satyen Kale
In Submission, 2026.

Publications

Large Language Models

AI-rithmetic PDF
with Alex Bie, Travis Dick, Alex Kulesza, Prabhakar Raghavan, Sergei Vassilvitskii
ICLR Workshop on I Can't Believe It's Not Better (ICBINB), 2026.

Language Generation

Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation PDF
with Seamus Somerstep , Unique Subedi , Yuekai Sun
Conference on Artificial Intelligence and Statistics (AISTATS), 2026.
also at Conference on the Mathematical Theory of Deep Neural Networks (DeepMath), 2025.
Generation through the lens of learning theory PDF
with Jiaxun Li , Ambuj Tewari
Conference on Learning Theory (COLT), 2025.
Representative Language Generation PDF
with Charlotte Peale , Omer Reingold
International Conference on Machine Learning (ICML), 2025.
Generation from Noisy Examples PDF
with Ananth Raman
International Conference on Machine Learning (ICML), 2025.

Differential Privacy

Missing Mass for Differentially Private Domain Discovery Oral PDF
with Matthew Joseph , Travis Dick
International Conference on Learning Representations (ICLR), 2026.
Tracking the Best Expert Privately PDF
with Hilal Asi , Aadirupa Saha
International Conference on Machine Learning (ICML), 2025.
Faster Rates for Private Adversarial Bandits PDF
with Hilal Asi, Kunal Talwar
International Conference on Machine Learning (ICML), 2025.

Beyond Worst-case Guarantees for Learning

Online Classification with Predictions PDF
with Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2024.
Smoothed Online Classification can be Harder than Batch Classification PDF
with Unique Subedi, Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2024.
Multiclass Transductive Online Learning Spotlight PDF
with Steve Hanneke, Amirreza Shaeiri, Unique Subedi
Conference on Neural Information Processing Systems (NeurIPS), 2024.
On Proper Learnability between Average- and Worst-case Robustness PDF
with Unique Subedi, Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2023.

Online Learning

The Complexity of Sequential Prediction in Dynamical Systems Oral PDF
with Unique Subedi, Ambuj Tewari
Conference on Learning for Dynamics and Control (L4DC), 2025.
A Unified Theory of Supervised Online Learnability Outstanding PaperPDF
with Unique Subedi, Ambuj Tewari
Conference on Algorithmic Learning Theory (ALT), 2025.
Online Learning with Set-Valued Feedback PDF
with Unique Subedi, Ambuj Tewari
Conference on Learning Theory (COLT), 2024.
Online Infinite-Dimensional Regression: Learning Linear Operators PDF
with Unique Subedi, Ambuj Tewari
Conference on Algorithmic Learning Theory (ALT), 2024.
Multiclass Online Learning and Uniform Convergence PDF
with Steve Hanneke, Shay Moran, Unique Subedi, Ambuj Tewari
Conference on Learning Theory (COLT), 2023.
Online Agnostic Multiclass Boosting PDF
with Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2022.

Partial Feedback

Apple Tasting: Combinatorial Dimensions and Minimax Rates PDF
with Ananth Raman , Unique Subedi, Ambuj Tewari
Conference on Learning Theory (COLT), 2024.
Multiclass Online Learnability under Bandit Feedback PDF
with Ananth Raman , Unique Subedi, Idan Mehalel, Ambuj Tewari
Conference on Algorithmic Learning Theory (ALT), 2024.

Multioutput Learning

A Characterization of Multioutput Learnability PDF
with Unique Subedi, Ambuj Tewari
Journal of Machine Learning Research (JMLR), 2024.
On the Learnability of Multilabel Ranking SpotlightPDF
with Unique Subedi, Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2023.

Other

Design of thermophotovoltaics for tolerance of parasitic absorption PDF
with Tobias Burger, Andrej Lenert
Optics Express, 2019.

Vinod Raman