I’m a Research Scientist at Google DeepMind working on improving Gemini’s capabilities for retrieval and ranking.

I completed my Ph.D. in Machine Learning at the University of Michigan, where I was advised by Ambuj Tewari. Previously, I studied Computer Science and Chemical Engineering also at UM where I worked with Mahdi Cheraghchi, Sindhu Kutty, and Andrej Lenert.

My research interests lie in the Foundations of Machine Learning. In the past, I worked on online learning/bandits, adversarial robustness, and beyond-worst-case analysis for learning algorithms, among other things. Currently, I am interested in post-training for LLMs, and in particular, inference-time methods for prompt-adaptive sampling.

During my Ph.D, I was fortunate to be supported by the National Science Foundation Graduate Research Fellowship (NSF GRFP) and the 2025 Apple Scholars in AI/ML PhD Fellowship.

Click here to see my CV.

Preprints

  1. Estimating the (Un)seen: Sample-dependent Mass Estimation
    with Vitaly Feldman, Satyen Kale, Kunal Talwar, and Ambuj Tewari
    Preprint, 2025.
  2. Online Boosting for Multilabel Ranking with Top-k Feedback
    with Daniel T. Zhang, Young Hun Jung, Ambuj Tewari
    Preprint, 2020.

In Submission

  1. Optimal Stopping vs Best-of-N for Inference Time Optimization
    with Yusuf Kalayci , Shaddin Dughmi
    In Submission, 2025.
  2. Transductive and Learning-Augmented Online Regression
    with Shenghao Xie , Samson Zhou
    In Submission, 2025.
  3. Missing Mass for DIfferentially Private Domain Discovery
    with Matthew Joseph , Travis Dick
    In Submission, 2025.
  4. AdaBoN: Adaptive Best-of-N Alignment
    with Hilal Asi , Satyen Kale
    In Submission, 2025.

Publications

Language Generation

  1. Learning to Choose or Choosing to Learn: Best-of-N vs. Supervised Fine-Tuning for Bit String Generation
    with Seamus Somerstep , Unique Subedi , Yuekai Sun
    Conference on the Mathematical Theory of Deep Neural Networks (DeepMath), 2025.

  2. Generation through the lens of learning theory
    with Jiaxun Li , Ambuj Tewari
    Conference on Learning Theory (COLT), 2025.

  3. Representative Language Generation
    with Charlotte Peale , Omer Reingold
    International Conference on Machine Learning (ICML), 2025.

  4. Generation from Noisy Examples
    with Ananth Raman
    International Conference on Machine Learning (ICML), 2025.

Differential Privacy

  1. Tracking the Best Expert Privately
    with Hilal Asi , Aadirupa Saha
    International Conference on Machine Learning (ICML), 2025.

  2. Faster Rates for Private Adversarial Bandits
    with Hilal Asi, Kunal Talwar
    International Conference on Machine Learning (ICML), 2025.

Beyond Worst-case Guarantees for Learning

  1. Online Classification with Predictions
    with Ambuj Tewari
    Conference on Neural Information Processing Systems (NeurIPS), 2024.

  2. Smoothed Online Classification can be Harder than Batch Classification
    with Unique Subedi, Ambuj Tewari
    Conference on Neural Information Processing Systems (NeurIPS), 2024.

  3. Multiclass Transductive Online Learning
    with Steve Hanneke, Amirreza Shaeiri, Unique Subedi
    Conference on Neural Information Processing Systems (NeurIPS), 2024. Spotlight.

  4. On Proper Learnability between Average- and Worst-case Robustness
    with Unique Subedi, Ambuj Tewari
    Conference on Neural Information Processing Systems (NeurIPS), 2023.

Online Learning

  1. The Complexity of Sequential Prediction in Dynamical Systems
    with Unique Subedi, Ambuj Tewari
    Conference on Learning for Dynamics and Control (L4DC), 2025. Oral Presentation.

  2. A Unified Theory of Supervised Online Learnability
    with Unique Subedi, Ambuj Tewari
    Conference on Algorithmic Learning Theory (ALT), 2025. Outstanding Paper Award.

  3. Online Learning with Set-Valued Feedback
    with Unique Subedi, Ambuj Tewari
    Conference on Learning Theory (COLT), 2024.

  4. Online Infinite-Dimensional Regression: Learning Linear Operators
    with Unique Subedi, Ambuj Tewari
    Conference on Algorithmic Learning Theory (ALT), 2024.

  5. Multiclass Online Learning and Uniform Convergence
    with Steve Hanneke, Shay Moran, Unique Subedi, Ambuj Tewari
    Conference on Learning Theory (COLT), 2023.

  6. Online Agnostic Multiclass Boosting
    with Ambuj Tewari
    Conference on Neural Information Processing Systems (NeurIPS), 2022.

Partial Feedback

  1. Apple Tasting: Combinatorial Dimensions and Minimax Rates
    with Ananth Raman , Unique Subedi, Ambuj Tewari
    Conference on Learning Theory (COLT), 2024.

  2. Multiclass Online Learnability under Bandit Feedback
    with Ananth Raman , Unique Subedi, Idan Mehalel, Ambuj Tewari
    Conference on Algorithmic Learning Theory (ALT), 2024.

Multioutput Learning

  1. A Characterization of Multioutput Learnability
    with Unique Subedi, Ambuj Tewari
    Journal of Machine Learning Research (JMLR), 2024.

  2. On the Learnability of Multilabel Ranking
    with Unique Subedi, Ambuj Tewari
    Conference on Neural Information Processing Systems (NeurIPS), 2023. Spotlight.

Other

  1. Design of thermophotovoltaics for tolerance of parasitic absorption
    with Tobias Burger, Andrej Lenert
    Optics Express, 2019.

Talks

  • Optimal Stopping vs Best-of-N for Inference Time Optimization (Google DeepMind Tech Talk 2025) [slides]
  • A Unified Theory of Supervised Online Learnability (ALT 2025)
  • Generation through the lens of learning theory (Apple 2025)
  • Generation through the lens of learning theory (NEU CS Theory Seminar) [slides]
  • Generation through the lens of learning theory (STATS 700 Guest Lecture)
  • Trichotomies in Online Learnability (Student ML Research Seminar 2024) [slides]
  • Trichotomies in Online Learnability (Apple 2024) [slides]
  • Revisiting the Learnability of Apple Tasting (MSSISS 2024)
  • Multiclass Online Learnability under Bandit Feedback (ALT 2024)
  • Multiclass Online Learning and Uniform Convergence (UM EECS Theory Seminar) [slides]
  • On Classification-Calibration of Gamma-Phi Losses (COLT 2023) [slides]

Awards and Fellowships

  • Apple Scholars in AI/ML PhD Fellowship (2025)
  • ALT Outstanding Paper Award (2025)
  • MSSISS Best Oral Presentation (2024)
  • Outstanding First-year Ph.D. Student (2022)
  • NSF Graduate Research Fellowship (2022)

Hobbies

Apart from research, I am a big fan of bodybuilding and actively keep up with the Mr. Olympia.