I’m a Research Scientist at Google DeepMind working on improving Gemini’s capabilities for retrieval and ranking.
I completed my Ph.D. in Machine Learning at the University of Michigan, where I was advised by Ambuj Tewari. Previously, I studied Computer Science and Chemical Engineering also at UM where I worked with Mahdi Cheraghchi, Sindhu Kutty, and Andrej Lenert.
My research interests lie in the Foundations of Machine Learning. In the past, I worked on online learning/bandits, adversarial robustness, and beyond-worst-case analysis for learning algorithms, among other things. Currently, I am interested in post-training for LLMs, and in particular, inference-time methods for prompt-adaptive sampling.
During my Ph.D, I was fortunate to be supported by the National Science Foundation Graduate Research Fellowship (NSF GRFP) and the 2025 Apple Scholars in AI/ML PhD Fellowship.
Click here to see my CV.
Preprints
- Estimating the (Un)seen: Sample-dependent Mass Estimation
with Vitaly Feldman, Satyen Kale, Kunal Talwar, and Ambuj Tewari
Preprint, 2025. - Online Boosting for Multilabel Ranking with Top-k Feedback
with Daniel T. Zhang, Young Hun Jung, Ambuj Tewari
Preprint, 2020.
In Submission
- Optimal Stopping vs Best-of-N for Inference Time Optimization
with Yusuf Kalayci , Shaddin Dughmi
In Submission, 2025. - Transductive and Learning-Augmented Online Regression
with Shenghao Xie , Samson Zhou
In Submission, 2025. - Missing Mass for DIfferentially Private Domain Discovery
with Matthew Joseph , Travis Dick
In Submission, 2025. - AdaBoN: Adaptive Best-of-N Alignment
with Hilal Asi , Satyen Kale
In Submission, 2025.
Publications
Language Generation
Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation
with Seamus Somerstep , Unique Subedi , Yuekai Sun
Conference on the Mathematical Theory of Deep Neural Networks (DeepMath), 2025.Generation through the lens of learning theory
with Jiaxun Li , Ambuj Tewari
Conference on Learning Theory (COLT), 2025.Representative Language Generation
with Charlotte Peale , Omer Reingold
International Conference on Machine Learning (ICML), 2025.Generation from Noisy Examples
with Ananth Raman
International Conference on Machine Learning (ICML), 2025.
Differential Privacy
Tracking the Best Expert Privately
with Hilal Asi , Aadirupa Saha
International Conference on Machine Learning (ICML), 2025.Faster Rates for Private Adversarial Bandits
with Hilal Asi, Kunal Talwar
International Conference on Machine Learning (ICML), 2025.
Beyond Worst-case Guarantees for Learning
Online Classification with Predictions
with Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2024.Smoothed Online Classification can be Harder than Batch Classification
with Unique Subedi, Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2024.Multiclass Transductive Online Learning
with Steve Hanneke, Amirreza Shaeiri, Unique Subedi
Conference on Neural Information Processing Systems (NeurIPS), 2024. Spotlight.On Proper Learnability between Average- and Worst-case Robustness
with Unique Subedi, Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2023.
Online Learning
The Complexity of Sequential Prediction in Dynamical Systems
with Unique Subedi, Ambuj Tewari
Conference on Learning for Dynamics and Control (L4DC), 2025. Oral Presentation.A Unified Theory of Supervised Online Learnability
with Unique Subedi, Ambuj Tewari
Conference on Algorithmic Learning Theory (ALT), 2025. Outstanding Paper Award.Online Learning with Set-Valued Feedback
with Unique Subedi, Ambuj Tewari
Conference on Learning Theory (COLT), 2024.Online Infinite-Dimensional Regression: Learning Linear Operators
with Unique Subedi, Ambuj Tewari
Conference on Algorithmic Learning Theory (ALT), 2024.Multiclass Online Learning and Uniform Convergence
with Steve Hanneke, Shay Moran, Unique Subedi, Ambuj Tewari
Conference on Learning Theory (COLT), 2023.Online Agnostic Multiclass Boosting
with Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2022.
Partial Feedback
Apple Tasting: Combinatorial Dimensions and Minimax Rates
with Ananth Raman , Unique Subedi, Ambuj Tewari
Conference on Learning Theory (COLT), 2024.Multiclass Online Learnability under Bandit Feedback
with Ananth Raman , Unique Subedi, Idan Mehalel, Ambuj Tewari
Conference on Algorithmic Learning Theory (ALT), 2024.
Multioutput Learning
A Characterization of Multioutput Learnability
with Unique Subedi, Ambuj Tewari
Journal of Machine Learning Research (JMLR), 2024.On the Learnability of Multilabel Ranking
with Unique Subedi, Ambuj Tewari
Conference on Neural Information Processing Systems (NeurIPS), 2023. Spotlight.
Other
- Design of thermophotovoltaics for tolerance of parasitic absorption
with Tobias Burger, Andrej Lenert
Optics Express, 2019.
Talks
- Optimal Stopping vs Best-of-N for Inference Time Optimization (Google DeepMind Tech Talk 2025) [slides]
- A Unified Theory of Supervised Online Learnability (ALT 2025)
- Generation through the lens of learning theory (Apple 2025)
- Generation through the lens of learning theory (NEU CS Theory Seminar) [slides]
- Generation through the lens of learning theory (STATS 700 Guest Lecture)
- Trichotomies in Online Learnability (Student ML Research Seminar 2024) [slides]
- Trichotomies in Online Learnability (Apple 2024) [slides]
- Revisiting the Learnability of Apple Tasting (MSSISS 2024)
- Multiclass Online Learnability under Bandit Feedback (ALT 2024)
- Multiclass Online Learning and Uniform Convergence (UM EECS Theory Seminar) [slides]
- On Classification-Calibration of Gamma-Phi Losses (COLT 2023) [slides]
Awards and Fellowships
- Apple Scholars in AI/ML PhD Fellowship (2025)
- ALT Outstanding Paper Award (2025)
- MSSISS Best Oral Presentation (2024)
- Outstanding First-year Ph.D. Student (2022)
- NSF Graduate Research Fellowship (2022)
Hobbies
Apart from research, I am a big fan of bodybuilding and actively keep up with the Mr. Olympia.
